PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

C12N 15/01, 15/11, 15/12, 15/33, 15/62, 
15/70, C07K 14/00, 14/005, 14/435, 
19/00, A61K 38/16, 38/17, C12Q 1/02, 
1/68, 1/70 ; 



Al 



(11) International Publication Number: 
(43) International Publication Date: 



WO 95/19431 

20 July 1995 (20.07.95) 



(21) International Application Number: PCT/US95/00829 

(22) International Filing Date: 18 January 1995 (18.01.95) 



(30) Priority Data: 
08/183,119 
08/312,604 



18 January 1994 (18.01.94) US 
28 September 1994 (28.09.94) US 



(60) Parent Application or Grant 
(63) Related by Continuation 
US 

Filed on 



08/312,604 (CIP) 
28 September 1994 (28.09.94) 



(71) Applicant (for all designated States except US): THE SCRIPPS 

RESEARCH INSTITUTE [US/US]; 10666 North Toirey 
Pines Road, La Jolla, CA 92037 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): BARBAS, Carlos, F.. ID 
[US/US1; 7081 Weller Street; San Diego, CA 92122 (US). 
GOTTESFELD, Joel, M. [US/US]; 1 4269 Mango Drive, San 
Diego, CA 92122 (US). WRIGHT, Peter, E. [US/US]; 7221 
Rue Michael, La Jolla, CA 92037 (US). 



(74) Agents: HAILE, Lisa, A. et al.; Spensley, Horn, Jubas & 
Lubitz, 5th floor, 1880 Century Park East, Los Angeles, 
CA 90067 (US). 



(81) Designated States: AU, CA, FI, JP, NO, US, European patent 
(AT. BE, CH, DE, DK. ES, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE). 



Pubhshed 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: ZINC FINGER PROTEIN DERIVATIVES AND METHODS THEREFOR 
(57) Abstract 

An assay is described which allows for identification of novel modulating zinc finger-nucleoUde binding j"™ 
polypeptides. 



Codes used to identify 
applications under the PCT. 

AT Austria 

AU Australia 

BB Barbados 

BE Belgium 

BF Burkina Faso 

BG Bulgaria 

BJ Benin 

BR Brazil 

BY Belarus 

CA Canada 

CF Central African Republic 

CG Congo 

CH Switzerland 

CI C6te d'lvoire 

CM Cameroon 

CN China 

CS Czechoslovakia 

CZ Czech Republic 

DE Germany 

DK Denmark 

ES Spain 

H Finland 

FR France 

GA Gabon 



FOR THE PURPOSES OF INFORMATION ONLY 

States party to the PCT on the front pages of pamphlets publishing international 



GB 

GE 

GN 

GR 

HU 

IE 

IT 

JP 

KE 

KG 

KF 

KR 

KZ 

LI 

LK 

LU 

LV 

MC 

MD 

MG 

ML 

MN 



United Kingdom 

Georgia 

Guinea 

Greece 

Hungary 

Ireland 

Italy 



Kenya 
Kyrgystan 

Democratic People's Republic 
of Korea 

Republic of Korea 

Kazakhstan 

Liechtenstein 

Sri Lanka 

Luxembourg 

Latvia 

Monaco 

Republic of Moldova 

Madagascar 

Mali 

Mongolia 



MR 


Mauritania 


MW 


Malawi 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


FT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


US 


United States of America 


uz 


Uzbekistan 


VN 


Viet Nam 



WO 95/19431 PCT/US95/00829 

-1- 

ZINC FINGER PROTEIN DERIVATIVES AND METHODS THEREFOR 

This application is a continuation-in-part of application Serial No. 08/312,604, filed 
September 28, 1 994, which is a continuation-in-part of application Serial No. 08/1 83, 1 1 9, 
filed January 18, 1994. 

5 BACKGROUND OF THE INVENTION 
1 . Field of the Invention 

This invention relates generally to the field of regulation of gene expression and 
specifically to methods of modulating gene expression by utilizing polypeptides derived 
from zinc finger-nucleotide binding proteins. 

10 2. Description of Related Art 

Transcriptional regulation is primarily achieved by the sequence-specific binding of 
proteins to DNA and RNA. Of the known protein motifs involved in the sequence 
specific recognition of DNA, the zinc finger protein is unique in its modular nature. To 
date, zinc finger proteins have been identified which contain between 2 and 37 modules. 

1 5 More than two hundred proteins, many of them transcription factors, have been shown to 
possess zinc fingers domains. Zinc fingers connect transcription factors to their target 
genes mainly by binding to specific sequences of DNA base pairs - the "rungs" in the 
DNA "ladder". 

Zinc finger modules are approximately 30 amino acid-long motifs found in a wide variety 
20 of transcription regulatory proteins in eukaryotic organisms. As the name implies, this 
nucleic acid binding protein domain is folded around a zinc ion. The zinc finger domain 
was first recognized in the transcription factor TFIIIA from Xenopus oocytes (Miller, et 
al t EMBO, 4:1609-1614, 1985; Brown, et al, FEBS Lett, lM:271-274, 1985). This 
protein consists of nine imperfect repeats of a consensus sequence: 
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CTyr, Ph.>X<:ys-X„.C yS -X J -Pb=-X ! .L=u-X I -H b -X )J -His-X„ (SEQ ID NO: 1) 
where X is any amino acid. 

Lta mnA, m os t z i »cf m6 e r p ro ,^ ta v eOTKOT =dcy S « iK andhis«d in e™s i du e ste 
*e single zinc * in each finger domain. The secure o 
5 individual zinc finger peptides of fins type (containing two cysteines and two mandates) 
such as .hose fonnd in the yeas, prmein ADR1, the human male associaten proton ZFY 
me mv enhance, p,o,ein and *. X— P-in Xfin have b- >*0, htgh 
notation NMR methods (Kochoyan, « A M-»>* * J*"' 
Omichinski, « A M-»» 22:9324-9334, .990; Lee, « of.. Science, 245:635-6,7, 

,0 1989) and de«ai.ed models for fire interaction of zinc fingers and DMA have been 
pro pos.d(B.rg,.9 88 ;Berg,.990iChn rc Wl.,« O M990).Moreove,,mesm.cm,eofa 

Le finaer polypepdde-DNA complex derived from the mouse immediate early protetn 

Paho, Science. 212:809-817, 199!). Each finger conttins an antiparallel p-rum, a finger 

in fire maior groove o, DN A. ,n addition, the conserved hydrophobic antino actds and 
^ coordination hy the cystetae and histidine residues stabilize fire stnacrure of me 
individual finger domain. 

While five prototype rinc finger prorein TFH1A contains an array of nine zinc fingers 
Cass OCrox-20, Sp., for example) contain only fitree zinc fingers within a much larger 
^fion^ence-Mo^of^DNAc^n^hyziOoSareMm^^ 

J^nine residues on one OHA sfrand in Ore major groove of the 

contrast, fite mechanism of THHA binding to DNA is more complex. The anmo 
^3rincfi J ,gers^gmzea 1 3bpse,uenee^b to dmuzen MJ org^ve^ 

,„ rif268, rhese fingers a)so mahe *ranine contact primarily on one snand of theDNA 
Uulikc the *068 ciaas of proteins, zinc fingers 4 and 6 of THOA each bmd etdter . 



20 



25 
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across the minor groove, bringing fingers 5 and 7 through 9 back into contact with the 
major groove (Clemens, et al, Proc. Natl Acad. Sci. USA, 52:10822-10826, 1992). 

The crystal structure of zif268, indicates that specific histidine (non-zinc coordinating his 
residues) and arginine residues on the surface of the a-helix participate in DNA 
5 recognition. Specifically, the charged amino acids immediately preceding the a -helix and 
at helix positions 2, 3, and 6 (immediately preceding the conserved histidine) participate 
in hydrogen bonding to DNA guanines. Similar to finger 2 of the regulatory protein 
Krox-20 and fingers 1 and 3 of Spl, finger 2 of TFIIIA contains histidine and arginine 
residues at these DNA contact positions; further, each of these zinc fingers minimally 

1 0 recognizes the sequence GGG. Finger swap experiments between transcription factor Sp 1 
and Krox-20 have confirmed the 3-bp zinc finger recognition code for this class of finger 
proteins (Nardelli, et al, Nature, 242:175-178, 1989). Mutagenesis experiments have 
also shown the importance of these amino acids in specifying DNA recognition. It would 
be desirable to ascertain a simple code which specifies zinc finger-nucleotide recognition. 

15 If such a code could be deciphered, then zinc finger polypeptides might be designed to 
bind any chosen DNA sequence. The complex of such a polypeptide and its recognition 
sequence might be utilized to modulate (up or down) the transcriptional activity of the 
gene containing this sequence. 

Zinc finger proteins have also been reported which bind to RNA. Clemens, et al, 
20 {Science, 260:530, 1 993) found that fingers 4 to 7 of TFIIIA contribute 95% of the free 
energy of TFIIIA binding to 5S rRNA, whereas fingers 1 to 3 make a similar contribution 
in binding the promoter of the 5S gene. Comparison of the two known 5S RNA binding 
proteins, TFIIIA and p43, reveals few homologies other than the consensus zinc ligands 
(C and H), hydrophobic amino acids and a threonine-tryptophan-threonine triplet motif 
25 in finger 6. 

In order to redesign zinc fingers, new selective strategies must be developed and 
additional information on the structural basis of sequence-specific nucleotide recognition 
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„ required. Current prorata engineering efforrs utilize design snaugies ba*d on 
« and/or sudoral ana,ogy. While such a sh.tegy may be sufficient for the 
JLfer of monfc, U „mi.s the ability to produce novel nucleotide burding nrottfs not 

6 ton e t »iU,onlymodes. S uccess( D =siarlaisandB=rg,^,12:10.,1992). 

As a consequence, there exists a need for new suaregies for designing additional zinc 
flngcra with specific recognition sites as we,, as nove, rinc fingers for enhancing or 
repressing gene expression. The present invention fulfills this need. 
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SUMMARY OF THE INVENTION 

The invention provides an isolated zinc fmger-nucleotide binding polypeptide variant 
comprising at least two zinc finger modules that bind to a cellular nucleotide sequence 
and modulate the function of the cellular nucleotide sequence. The variant binds to either 
5 DNA or RN A and may enhance or suppress transcription from a promoter or from within 
a structural gene. The cellular nucleotide sequence may be a sequence which is a 
naturally occurring sequence in the cell, or it may be a viral-derived nucleotide sequence 
in the cell. 

In another embodiment, the invention provides a pharmaceutical composition comprising 
10 a therapeutically effective amount of a zinc finger-nucieotide binding polypeptide 
derivative or a therapeutically effective amount of a nucleotide sequence which encodes 
a zinc fmger-nucleotide binding polypeptide derivative, wherein the derivative binds to 
a cellular nucleotide sequence to modulate the function of the cellular nucleotide 
sequence, in combination with a pharmaceutically acceptable carrier. 

15 In a further embodiment, the invention provides a method for inhibiting a cellular 
nucleotide sequence comprising a zinc fmger-nucleotide binding motif, the method 
comprising contacting the motif with a zinc fmger-nucleotide binding polypeptide 
derivative which binds the motif. 

In yet a further embodiment, the invention provides a method for obtaining an isolated 
20 zinc fmger-nucleotide binding polypeptide variant which binds to a cellular nucleotide 
sequence comprising identifying the amino acids in a zinc fmger-nucleotide binding 
polypeptide that bind to a first cellular nucleotide sequence and modulate the function of 
the nucleotide sequence; creating an expression library encoding the polypeptide variant 
containing randomized substitution of the amino acids identified; expressing the library 
25 in a suitable host cell; and isolating a clone that produces a polypeptide variant that binds 
to a second cellular nucleotide sequence and modulates the function of the second 
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nucleotide sequence. Preferably, the expression library encoding the polypeptide variant 
is a phage display library. 

The invention also provides a method of treating a subject with a cell proliferative 
disorder, wherein the disorder is associated with the modulation of gene express.cn 
associated with a zinc finger-nucleotide binding motif, comprising contacting the zmc 
finger-nucleotide binding motif with an effective amount of a zinc finger-nucleoUde 
binding polypeptide derivative that binds to the zinc finger-nucleotide binding mouf to 
modulate activity of the gene. 

Further, the invention provides a method for identifying a protein which modulates the 
function of a cellular nucleotide sequence and binds to a zinc finger-nucleotide binding 
motif comprising incubating components comprising a nucleotide sequence encodmg the 
putative modulating protein operably linked to a first inducible promoter, and a reporter 
gene operably linked to a second inducible promoter and a zinc finger-nucleoude bmdmg 
mo tif, wherein the incubating is carried out under conditions sufficient to allow the 
components to interact; and measuring the effect of the putative modulating protein on 
the expression of the reporter gene. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 shows a model for the interaction of the zinc fingers of TFHIA with the 
internal promoter of the 5S RNA gene. 

FIGURE 2A shows the amino acid sequence of the first three amino terminal zinc fingers 
5 ofTFIIIA. 

FIGURE 2B shows the nucleotide sequence of the minimal binding site for zf 1-3. 



FIGURE 3 shows a gel mobility shift assay for the binding of zfl-3 to a 23 bp 32 P-labeled 
double stranded oligonucleotide. 

FIGURE 4 shows an autoradiogram of in vitro transcription indicating that zfl-3 blocks 
1 0 transcription by T7 RNA polymerase. 

FIGURE 5 shows binding of zfl-3 to its recognition sequence blocks transcription from 
a T7RNA polymerase promoter located nearby. A plot of percent of DNA molecules 
bound by zfl-3 in a gel mobility shift assay (x-axis) is plotted against percent inhibition 
of T7RNA polymerase transcription (y-axis). 

15 FIGURE 6 is an autoradiogram showing zfl-3 blocks eukaryotic RNA polymerase III 
transcription in an in vitro transcription system derived from unfertilized Xenopus eggs. 

FIGURE 7 shows the nucleotide and deduced amino acid sequence for the zinc fingers 
of zif268 which were cloned in pComb 3.5. 
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FIGURE 8 shows the amino acid sequence of the Zif268 protein and the hairpin DNA 
used for phage selection. (A) shows the conserved features of each zinc finger. (B) 
shows the hairpin DNA containing the 9-bp consensus binding site. 

FIGURE 9 is a table listing of the six randomized residues of finger 1, 2, and 3. 

FIGURE 10 shows an SDS-PAGE of Zif268 variant A14 before IPTG induction (lane 2); 
after IPTG induction (lane 3); cytoplasmic fraction after removal of inclusion bodies (lane 
4); inclusion bodies containing zinc finger peptide (lane 5); and mutant Zif268 (lane 6). 
Lane 1 is MW Standards (kD). 

FIGURE 11 is a table indicating k m , association rate; * off , dissociation rate; and K d 
1 0 equilibrium dissociation constant, for each protein. 

FIGURE 12 shows dissociation rate (U of wild-type Zif268 protein (WT) (□) and its 
variant C7 (o), by real-time changes in surface plasmon resonance. 

FIGURE 13A and B show the nucleotide and amino acid sequence of Zif268-Jun (SEQ 
IDNOS:33 and 34). 

15 FIGURE 14A and B show the nucleotide and amino acid sequence of Zif268-Fos (SEQ 
IDNOS:35 and 36). 

FIGURE 15 shows the nucleotide and amino acid sequence of the three finger 
construction of C7 zinc finger (SEQ ID NOS: 41 and 42). 

FIGURE 16A and B show the nucleotide and amino acid sequence of Zif268-Zif268 
20 linked by a TGEKP linker (SEQ ID NOS: 43 and 44). 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides an isolated zinc finger-nucleotide binding polypeptide 
variant comprising at least two zinc finger modules that bind to a cellular nucleotide 
sequence and modulate the function of the cellular nucleotide sequence. The polypeptide 
5 variant may enhance or suppress transcription of a gene, and may bind to DNA or RNA. 
In addition, the invention provides a pharmaceutical composition comprising a 
therapeutically effective amount of a zinc finger-nucleotide binding polypeptide 
derivative or a therapeutically effective amount of a nucleotide sequence that encodes a 
zinc finger-nucleotide binding polypeptide derivative, wherein the derivative binds to a 
1 0 cellular nucleotide sequence to modulate the function of the cellular nucleotide sequence, 
in combination with a pharmaceutically acceptable carrier. The invention also provides 
a screening method for obtaining a zinc finger-nucleotide binding polypeptide variant 
which binds to a cellular or viral nucleotide sequence. 

A zinc finger-nucleotide binding polypeptide "variant" refers to a polypeptide which is 
15 a mutagenized form of a zinc finger protein or one produced through recombination. A 
variant may be a hybrid which contains zinc finger domain(s) from one protein linked to 
zinc finger domain(s) of a second protein, for example. The domains may be wild type 
or mutagenized. A "derivative" includes a truncated form of a wild type zinc finger 
protein, which contains less than the original number of fingers in the wild type protein. 
20 A derivative also includes variant zinc finger polypeptides. Examples of zinc finger- 
nucleotide binding polypeptides from which a derivative or variant may be produced 
include TFIHA and zif268. 

As used herein a "zinc finger-nucleotide binding motif* refers to any two or three- 
dimensional feature of a nucleotide segment to which a zinc finger-nucleotide binding 
25 derivative polypeptide binds with specificity. Included within this definition are 
nucleotide sequences, generally of five nucleotides or less, as well as the three 
dimensional aspects of the DNA double helix, such as the major and minor grooves, the 
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face of tine helix, and .he .ike. The motif is^ally any sequence of saiuhle lengthro 

t0 a motive* having abou, 9 to abou, 14 base pairs. Therefore, rhe invention 
provides ainc finger-nucleotide binding polypeptides of any specificiry, and Ure amc 

Jger prorein bbrds. Tire motif may be fonnd in any DNA or RNA se q uenc=. mcludrng 
regulamry sequences, exons, inrrons, or any non-coding sequence. 

,„ dre practice of mis urvention i. is nor necessary ma, the zinc finger-nucleotide binding 
m „,if be known in order • obrain a zincfinger nucleotide binding varian, poiypepnde. 
Alrhough ainc finger proreins have so far been identified only in eukaryores, ,« » 
apecificafiy comemplared wirhin me soope otitis invention mar zinc fmger-nucleo de 
Wnding motifs can be identified in non-eukaryotic DNA or ENA, especially in me * 
promorers of bacreria and viruses by me binding tirerero of tire geneucall, mod, fied 

ofmezincfinger.burdifferfiomaincfingerproreir.foundinnanarebymenmerbodof 
production, as well as rheir amino acid sequences and mree-dimensiona, srrucrure, 

The characrenstic smrcrure of me known wild we ainc finger proteins are made up of 
^^masmanyaaSTmodu.ar^emmpenm.wameachrepeatforn.mgnfi^ 
Wdingarmcarommre^ecWcoordmationbymeansofapairofconservedcysrem^ 

ar.a a pair of conserved histidines. Generally each finger also conrains conserve 
maintain its shape. 

lw „ ainc finger moduias Una, bind ,o a cellular nueleotide sequence and moduk* £ 
taction of rhe cefioUr nucleotide sequence. The rema "ceUuiar nuclide sequ^e 
refers to a nucleotide sequence which is presen, widrin me caU. .« is no, necess^y ft. 
te sequence be a narurally occurring sequence of me ee». For example, a refiovual 
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genome which is integrated within a host's cellular DNA ? would be considered a "cellular 
nucleotide sequence". The cellular nucleotide sequence can be DNA or RNA and 
includes both introns and exons. The cell and/or cellular nucleotide sequence can be 
prokaryotic or eukaryotic, including a yeast, virus, or plant nucleotide sequence. 

5 The term "modulate" refers to the suppression, enhancement or induction of a function. 
For example, the zinc finger-nucleotide binding polypeptide variant of the invention may 
modulate a promoter sequence by binding to a motif within the promoter, thereby 
enhancing or suppressing transcription of a gene operatively linked to the promoter 
cellular nucleotide sequence. Alternatively, modulation may include inhibition of 
1 0 transcription of a gene where the zinc finger-nucleotide binding polypeptide variant binds 
to the structural gene and blocks DNA dependent RNA polymerase from reading through 
the gene, thus inhibiting transcription of the gene. The structural gene may be a normal 
cellular gene or an oncogene, for example. 

The promoter region of a gene includes the regulatory elements that typically lie 5' to a 
1 5 structural gene. If a gene is to be activated, proteins known as transcription factors attach 
to the promoter region of the gene. This assembly resembles an "on switch" by enabling 
an enzyme to transcribe a second genetic segment from DNA into RNA. In most cases 
the resulting RNA molecule serves as a template for synthesis of a specific protein; 
sometimes RNA itself is the final product. 

The promoter region may be a normal cellular promoter or, for example, an onco- 
promoter. An onco-promoter is generally a virus-derived promoter. For example, the 
long terminal repeat (LTR) of retroviruses is a promoter region which may be a target for 
a zinc finger binding polypeptide variant of the invention. Promoters from members of 
the Lentivims group, which include such pathogens as human T-cell lymphotrophic virus 
(HTLV) 1 and 2, or human immunodeficiency virus (HIV) 1 or 2, are examples of viral 
promoter regions which may be targeted for transcriptional modulation by a zinc finger 
binding polypeptide of the invention. 
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Th e *tc ftngermuOeotide binding p-«* derivatives - — 
Mode poiypcpUdcs *. bind . a ce.ta.ar nucleotide sequence such as DNA, FN A o r 

B *. w-*.*— «»- ™ am.no act, ««— 

TGEK(P) (SEQ ID NO: 32) is typical., indicative of zinc fmge, domains w„ch b.nd 
a DNA sequence. .Wore, one can define whether a pan.cn.ar zinc ftnger 
ooCeodde butding po,ypepude preferab.y binds .0 DNA or RNA by exammatton of .he 
linker amino acids. 

10 ID one embodhnen, a merhod of Ute invendon inc.udes a method for 

jessing the funcrion of a cc.,u.az nuc.eotide sequence compnstng a ,„c fln 6 er 

^bindstouremorif. .nurecase where uneeumarnuclcodde sequence rsapronrorer 

a2tocito g«:DNAbh«iu,gn Wt if.The« ra ,^bidng"^ers«oO K supp^onof^ 
of Lvauon of —ion of a statural gene ope ra b, y Knxed ,o a promo«er 
eomainhrg a zhrc ftnger-nucleodde binding motif, for =xanrp.e. .n addlbon, d* « 

20 or within an RNA sequence. 

^ "effecdvc amount" mc.„des tha, — which resuUs in the deactivation of a 
previ ous,y acdvated promote, or ,ha t amount which — «. *. 

Leripdon of a stmctum, gene or trans.at.on of RNA. «. amount of amc ft£ 
25 oeriveul.eot.de binding po.vpep.ide required .a that — ~ ' " ^ 
o.sp,ace a nauve zinc fmge,nue,eo,,de b.nd.» 6 pro,e.n in an existtng pro.e,n/pmm*er 
Jlex or that amoun, necessary . compete with the native zino Bnger-nuOeoude 
:«m to form a cotnp,* with the promoter .tse.f. Shndarly. me amoun. 
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required to block a structural gene or RN A is that amount which binds to and blocks RNA 
polymerase from reading through on the gene or that amount which inhibits translation, 
respectively. Preferably, the method is performed intracellular^. By functionally 
inactivating a promoter or structural gene, transcription or translation is suppressed. 
5 Delivery of an effective amount of the inhibitory protein for binding to or "contacting" 
the cellular nucleotide sequence containing the zinc finger-nucleotide binding protein 
motif, can be accomplished by one of the mechanisms described herein, such as by 
retroviral vectors or liposomes, or other methods well known in the art. 

The zinc finger-nucleotide binding polypeptide derivative is derived or produced from a 
1 0 wild type zinc finger protein by truncation or expansion, or as a variant of the wild type- 
derived polypeptide by a process of site directed mutagenesis, or by a combination of the 
procedures. 

The term "truncated" refers to a zinc finger-nucleotide binding polypeptide derivative that 
contains less than the full number of zinc fingers found in the native zinc finger binding 

1 5 protein or that has been deleted of non-desired sequences. For example, truncation of the 
zinc finger-nucleotide binding protein TFHIA, which naturally contains nine zinc fingers, 
might be a polypeptide with only zinc fingers one through three. Expansion refers to a 
zinc finger polypeptide to which additional zinc finger modules have been added. For 
example, TFIIIA may be extended to 12 fingers by adding 3 zinc finger domains. In 

20 addition, a truncated zinc finger-nucleotide binding polypeptide may include zinc finger 
modules from more than one wild type polypeptide, thus resulting in a "hybrid" zinc 
finger-nucleotide binding polypeptide. 

The term "mutagenized" refers to a zinc finger derived-nucleotide binding polypeptide 
that has been obtained by performing any of the known methods for accomplishing 
25 random or site-directed mutagenesis of the DNA encoding the protein. For instance, in 
TFIIIA, mutagenesis can be performed to replace nonconserved residues in one or more 
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„ Truncated zinc finger-nucleotide binding 
of the repeats of the consensus sequence. Truncated zinc g 

proteins can also be mutagenized. 

Ex3mp.es of town ainc fi„g=r-nuc,eo,ide binding proteins .ha, can be ***** 
of skill in the art. 

^Tsmoun, of a ahrc fnger-nuOeoride binding po.vp«P«de denvabve or a 

„ uc ,eo«de binding po.ypep.ide derivanve, wherem the denvanve 
„ u c,eo,ide se.uenee ,o module taction of «b« * — 
comW na,ion whh a pharmaceuuc* accep«ab,e earner. "~ ~ 
training one or more of -be differen, ainc fi „g t ,nuc.eo,,de b,nd,ng 
„ described herein are useful in the therapeuUc metads of dre mvenUon. 

u ~f ac thev refer to compositions, earners, anuenib «i 
^« mm atical variations thereof, as they reier iu ^ 

a dminisua.ion to or upon a human wrthou. P 
20 effects sueh as nausea, dizziness, gastrie upset and the hke wh.cn wo 
ft* would prohibit administration of the compostaon. 

Tbe nrenaration of a phannacologica, composition that contains active ingredients 
„ prepaid as Sterne h^es eitar as solurions or auspens, 
,„ use can also be prepared. The preparation can also be emu.stf.ei. 
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The active ingredient can be mixed with excipients which are pharmaceutically acceptable 
and compatible with the active ingredient and in amounts suitable for use in the 
therapeutic methods described herein. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol or the like and combinations thereof. In addition, if desired, 
5 the composition can contain minor amounts of auxiliary substances such as wetting or 
emulsifying agents, as well as pH buffering agents and the like which enhance the 
effectiveness of the. active ingredient. 

The therapeutic pharmaceutical composition of the present invention can include 
pharmaceutically acceptable salts of the components therein. Pharmaceutically acceptable 

10 salts include the acid addition salts (formed with the free amino groups of the 
polypeptide) that are formed with inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids as acetic, tartaric, mandelic and the like. Salts 
formed with the free carboxyl groups can also be derived from inorganic bases such as, 
for example, sodium, potassium, ammonium, calcium or ferric hydroxides, and such 

15 organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, 
procaine and the like. 

Physiologically tolerable carriers are well known in the art. Exemplary of liquid carriers 
are sterile aqueous solutions that contain no materials in addition to the active ingredients 
and water, or contain a buffer such as sodium phosphate at physiological pH value, 
20 physiological saline or both, such as phosphate-buffered saline. Still further, aqueous 
carriers can contain more than one buffer salt, as well as salts such as sodium and 
potassium chlorides, dextrose, propylene glycol, polyethylene glycol and other solutes. 

Liquid compositions can also contain liquid phases in addition to and to the exclusion of 
water. Exemplary of such additional liquid phases are glycerin, vegetable oils such as 
25 cottonseed oil, organic esters such as ethyl oleate, and water-oil emulsions. 
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» invention includes . nuc.eot.de sequence encoding a zinc finser-mK.eofide binding 
po.ypepfide vanan, DNA sequences encoding fine zinc f,„ge.-nuc,eo„de btndtng 

LIL by Severn, tnefimr. For examp.e, *• ™ - * 
5 hybridan procedurcs which are we,, known in the an. These mOude bu, are no 
.Led tot (0 hybfidizauon of probes to genomic or cDNA .ibranes to etec, ahared. 
nU c,eo,ide sequences; (2, anubody screening of expression fibres ,o de.ee, sh^ed 
stntctuza, features; and (3) synthesis by the polymer chain reactton (PGR). RNA 

1 0 Ctrrmtr fnxocofc in Molecular Biology, Ansubel, er ol eds, 1989). 

The deve,optnent of specific DNA sequences encoding zinc finger-nucleofide biding 
paoteins of fire invention can be obtained by: (!) isolafion of a doub,e-sma»ded DN A 
equence from fire genonrie DNA; <2> chemical manufacture of a DNA sequenca ,0 
provide the necessary codons for the polypeptide of interest; and (3) in vitro synthesis of 
V doubled DNA saqueoce by reverse .anseHpfion of mRNA ^ 

encode donor ce„. .nfite inner ease, a do„b,e-s«nded DN A eonspiemen, of n^A 
fctLaHy fonned which is generai.y referred to as cDNA. Of mese three methods fo 

For obraining zinc finger denved-DNA binding po,ypepfides, fire synthesis of DNA 
fences is frequeofiy me ntefirod of choice when. fine, entire sequence of amrno^ 

acid residues of fine desired polypepfid. is no. known, me direc, synmesta of DNA 
25 ^ces^^b.eand^n^odofch.iee^fomrafionofcDNAse,^ 

^.lid-carrying cDNA Ubraries which are derived from reverse —on of 
I, whic" abundant in donor cefis fita, te ve a high ieve, of genetic expresston. 
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When used in combination with polymerase chain reaction technology, even rare 
expression products can be cloned. In those cases where significant portions of the amino 
acid sequence of the polypeptide are known, the production of labeled single or double- 
stranded DNA or RNA probe sequences duplicating a sequence putatively present in the 
5 target cDNA may be employed in DNA/DNA hybridization procedures which are carried 
out on cloned copies of the cDNA which have been denatured into a single-stranded form 
(Jay, etal, Nucleic Acid Research, 11:2325, 1983). 

Hybridization procedures are useful for the screening of recombinant clones by using 
labeled mixed synthetic oligonucleotide probes where each probe is potentially the 

10 complete complement of a specific DNA sequence in the hybridization sample which 
includes a heterogeneous mixture of denatured double-stranded DNA. For such 
screening, hybridization is preferably performed on either single-stranded DNA or 
denatured double-stranded DNA. Hybridization is particularly useful in the detection of 
cDNA clones derived from sources where an extremely low amount of mRNA sequences 

15 relating to the polypeptide of interest are present. By using stringent hybridization 
conditions directed to avoid non-specific binding, it is possible, for example, to allow the 
autoradiographic visualization of a specific cDNA clone by the hybridization of the target 
DNA to that single probe in the mixture which is its complete complement (Wallace, et 
al, Nucleic Acid Research, 2:879, 1981; Maniatis, et al, Molecular Cloning: A 

20 Laboratory Manual, Cold Spring Harbor Laboratory, 1 982). 

Screening procedures which rely on nucleic acid hybridization make it possible to isolate 
any gene sequence from any organism, provided the appropriate probe is available. 
Oligonucleotide probes, which correspond to a part of the sequence encoding the protein 
in question, can be synthesized chemically. This requires that short, oligopeptide 
25 stretches of amino acid sequence must be known. The DNA sequence encoding the 
protein can be deduced from the genetic code, however, the degeneracy of the code must 
be taken into account. It is possible to perform a mixed addition reaction when the 
sequence is degenerate. This includes a heterogeneous mixture of denatured double- 
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stranded DNA. For such screening, hybridization is preferably performed on either 
single-stranded DNA or denatured double-stranded DNA. 

Since the DNA sequences of the invention encode essentially all or part of an zinc finger- 
nucleotide binding protein, it is now a routine matter to prepare, subclone, and express 

5 thetnmcatexipoly^ 

Alternatively, by utilizing the DNA fragments disclosed herein which define the zinc 
f mg er-nucleotide binding polypeptides of the invention it is possible, in conjunction W th 
known techniques, to determine the DNA sequences encoding the entire zmc fmger- 
nucleotide binding protein. Such techniques are described in U.S. 4,394,443 and U.S. 

10 4,446,23 5 which are incorporated herein by reference. 

A cDNA expression library, such as lambda gtl 1, can be screened indirectly for zinc 
fing^-nucleotideb^ 

oneepitopcusingantibodies specific forthe zinc fmger-nucleotide binding protem. Such 
antibodies can be either polyclonally or monoclonally derived and used to detect 
15 expression product indicative of the presence of zinc fmger-nucleotide binding protem 
cDNA. Alternatively, binding of the derived polypeptides to DNA targets can be assayed 
by incorporated radiolabeled DNA into the target site and testing for retardatmn of 
electrophoretic mobility as compared with unbound target site. 

A preferred vector used for identification of truncated and/or mutagenized zinc finger- 
20 nucleotide binding polypeptides is a recombinant DNA (rDNA) molecule contaimng a 
nucleotide sequence that codes for and is capable of expressing a fusion polypeptide 
containing, in the direction of amino- to carboxy-terminus, (1) a prokaryotic secretion 
signal domain, (2) a heterologous polypeptide, and (3) a filamentous phage membrane 
anchordomain. The vector includes DNA expression control sequences for expressing 

25 the fusion polypeptide, preferably prokaryotic control sequences. 
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The filamentous phage membrane anchor is preferably a domain of the cpIII or cpVIII 
coat protein capable of associating with the matrix of a filamentous phage particle, 
thereby incorporating the fusion polypeptide onto the phage surface. 

The secretion signal is a leader peptide domain of a protein that targets the protein to the 
5 periplasmic membrane of gram negative bacteria. A preferred secretion signal is a pelB 
secretion signal. The predicted amino acid residue sequences of the secretion signal 
domain from two pelB gene product variants from Envinia carotova are described in Lei, 
etal. {Nature, 311:543-546, 1988). 

The leader sequence of the pelB protein has previously been used as a secretion signal for 
ftision proteins (Better, et al, Science, 240:1041-1043, 1988; Sastry, et al, Proc. Natl 
Acad. ScL USA, 8^:5728-5732, 1989; and Mullinax, et al, Proc. Natl Acad. Sex. USA, 
87:8095-8099, 1990). Amino acid residue sequences for other secretion signal 
polypeptide domains from E. colt 

useful in this invention can be found in Oliver, In Neidhard, F.C. (ed.), Escherichia coli 
and Salmonella Typhimurium, American Society for Microbiology, Washington, D.C., 
1:56-69 (1987). 

Preferred membrane anchors for the vector are obtainable from filamentous phage Ml 3 ? 
fl, fd, and equivalent filamentous phage. Preferred membrane anchor domains are found 
in the coat proteins encoded by gene III and gene VIII. The membrane anchor domain 
20 of a filamentous phage coat protein is a portion of the carboxy terminal region of the coat 
protein and includes a region of hydrophobic amino acid residues for spanning a lipid 
bilayer membrane, and a region of charged amino acid residues normally found at the 
cytoplasmic face of the membrane and extending away from the membrane. In the phage 
fl, gene VIII coat protein's membrane spanning region comprises residue Trp-26 through 
25 Lys-40, and the cytoplasmic region comprises the carboxyterminal 1 1 residues from 41 
to 52 (Ohkawa, et aL, J. Biol Chem., 25^:9951-9958, 1981): Thus, the amino acid 
residue sequence of a preferred membrane anchor domain is derived from the Ml 3 
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filamentous phage gene VIII coat protein (also designated cpVIII or CP 8). Gene VIII 
coat protein is present on a mature filamentous phage over the majority of the phage 
particle with typically about 2500 to 3000 copies of the coat protein. 

In addition, the amino acid residue sequence of another preferred membrane anchor 
5 domain is derived from the M13 filamentous phage gene III coat protein (also designated 
cpffl). Gene III coat protein is present on a mature filamentous phage at one end of the 
phage particle with typically about 4 to 6 copies of the coat protein. For detailed 
descriptions of the structure of filamentous phage particles, their coat proteins and particle 
assembly, see the reviews by Rached, et al. {Microbiol Rev., 10:401-427 1986; and 
1 0 Model, e, al., in "The Bacteriophages: Vol. T, R. Calendar, ed. Plenum Publishing Co., 
pp. 375-456, 1988). 

DNA expression control sequences comprise a set of DNA expression signals for 
expressing a structural gene product and include both 5' and 3' elements, as is well known, 
operatively linked to the cistron such that the cistron is able to express a structural gene 
15 product. The 5' control sequences define a promoter for initiating transcription and a 
ribosome binding site operatively linked at the 5' terminus of the upstream translatable 
DNA sequence. 

To achieve high levels of gene expression in E. coli, it is necessary to use not only strong 
promoters to generate large quantities of mRNA, but also ribosome binding sites to ensure 

20 that the mRNA is efficiently translated. In E. coli, the ribosome binding site includes an 
initiation codon (AUG) and a sequence 3-9 nucleotides long located 3-11 nucleotides 
upstreamfrom the initiation codon (Shine, e, al., Nature, m^, 1975). The sequence, 
AGGAGGU, which is called the Shine^Dalgamo (SD) sequence, is complementary to the 
3' end of E. coli 1 6S rRNA. Binding of the ribosome to mRNA and the sequence at the 

25 3' end of the mRNA can be affected by several factors: 
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(i) The degree of complementarity between the SD sequence and 3' end of 
the 16S rRNA. 

(ii) The spacing and possibly the RNA sequence lying between the SD 
sequence and the AUG (Roberts, et al, Proc. Natl. Acad. Sci. USA, 

5 26:760, 1979a; Roberts, et al., Proc. Natl. Acad. Sci. USA, 26:5596, 

1979b; Guarente, et al., Science, 2Q_2:1428, 1980; and Guarente, et al., 
Cell, 2Q:543, 1980). Optimization is achieved by measuring the level of 
expression of genes in plasmids in which this spacing is systematically 
altered. Comparison of different mRNAs shows that there are 
1 o statistically preferred sequences from positions -20 to +1 3 (where the A 

of the AUG is position 0) (Gold, et al., Annu. Rev. Microbiol., 21:365, 
1981). Leader sequences have been shown to influence translation 
dramatically (Roberts, et al., 1979 a, b supra). 

(iii) The nucleotide sequence following the AUG, which affects ribosome 
1 5 binding (Taniguchi, et al., J. Mol. Biol., J±g:533, 1978). 

The 3' control sequences define at least one termination (stop) codon in frame with and 
operatively linked to the heterologous fusion polypeptide. 

In preferred embodiments, the vector utilized includes a prokaryotic origin of replication 
or replicon, Le., a DNA sequence having the ability to direct autonomous replication and 

20 maintenance of the recombinant DNA molecule extra-chromosomally in a prokaryotic 
host cell, such as a bacterial host cell, transformed therewith. Such origins of replication 
are well known in the art. Preferred origins of replication are those that are efficient in 
the host organism. A preferred host cell is E. coli. For use of a vector in E. coli, a 
preferred origin of replication is ColEl found in pBR322 and a variety of other common 

25 plasmids. Also preferred is the pl5A origin of replication found on pACYC and its 
derivatives. The ColEl and pi 5A replicon have been extensively utilized in molecular 
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biology are available on a variety of plasmids and are described at least by Sambrook, 
et al. Molecular Cloning: a Laboratory Manual, 2nd edition, Cold Spring Harbor 
Laboratory Press, 1989). 

The ColEl and P 15A replieons are particularly preferred for use in the present invention 
5 because they each have the ability to direct the replication of plasmid in K coli while the 
other replicon is present in a second plasmid in the same E. coli cell. In other words, 
ColEl and P l 5 A are non-interfering replieons that allow the maintenance of two plasmids 
in the same host (see, for example, Sambrook, et al, supra, at pages 1.3-1.4). 

In addition, those embodiments that include a prokaryotic replicon also include a gene 
1 0 whose expression confers a selective advantage, such as drug resistance, to a bacterial 
host transformed therewith. Typical bacterial drug resistance genes are those that confer 
resistance to ampicillin, tetracycline, neomycin/kanamycin or cholamphenicol. Vectors 
typically also contain convenient restriction sites for insertion of translatable DNA 
sequences. Exemplary vectors are the plasmids P UC8, P UC9, P BR322, and P BR329 
15 available from BioRad Laboratories, (Richmond, CA) and pPL and P KK223 available 
from Pharmacia, (Piscataway, NJ) and pBS (Stratagene, La Jolla, CA). 

The vector comprises a first cassette that includes upstream and downstream translatable 
DNA sequences operatively linked via a sequence of nucleotides adapted for directional 
ligation to an insert DNA. The upstream translatable sequence encodes the secretin 
20 signal as denned herein. The downstream translatable sequence encodes the filamentous 
phage membrane anchor as defined herein. The cassette preferably includes DNA 
expression control sequences for expressing the zinc finger-derived polypeptide that is 
produced whena^inserttranslatableDN^^ 

into the cassette via the sequence of nucleotides adapted for directional ligation. The 
25 filamentous phage membrane anchor is preferably a domain of the cplll orcpVIII coat 
protein capable of binding the matrix of a filamentous phage particle, thereby 
incorporating the fusion polypeptide onto the phage surface. 
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The zinc finger derived polypeptide expression vector also contains a second cassette for 
expressing a second receptor polypeptide. The second cassette includes a second 
translatable DNA sequence that encodes a secretion signal, as defined herein, operatively 
linked at its 3' terminus via a sequence of nucleotides adapted for directional ligation to 
5 a downstream DNA sequence of the vector that typically defines at least one stop codon 
in the reading frame of the cassette. The second translatable DNA sequence is operatively 
linked at its 5' terminus to DNA expression control sequences forming the 5' elements. 
The second cassette is capable, upon insertion of a translatable DNA sequence (insert 
DNA), of expressing the second fusion polypeptide comprising a receptor of the secretion 
1 0 signal with a polypeptide coded by the insert DNA. . For purposes of this invention, the 
second cassette sequences have been deleted. 

As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting 
between different genetic environments another nucleic acid to which it has been 
operatively linked. Preferred vectors are those capable of autonomous replication and 
1 5 expression of structural gene products present in the DNA segments to which they are 
operatively linked. Vectors, therefore, preferably contain the replicons and selectable 
markers described earlier. 

As used herein with regard to DNA sequences or segments, the phrase "operatively 
linked" means the sequences or segments have been covalently joined, preferably by 

20 conventional phosphodiester bonds, into one strand of DNA, whether in single or double 
stranded form. The choice of vector to which transcription unit or a cassette of this 
invention is operatively linked depends directly, as is well known in the art, on the 
functional properties desired, e.g., vector replication and protein expression, and the host 
cell to be transformed, these being limitations inherent in the art of constructing 

25 recombinant DNA molecules. 

A sequence of nucleotides adapted for directional ligation, Le., a polylinker, is a region 
of the DNA expression vector that (1) operatively links for replication and transport the 
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upstream and downstream translatable DNA sequences and (2) provides a site or means 
for directional ligation of a DNA sequence into the vector. Typically, a directional 
polylinker is a sequence of nucleotides that defines two or more restriction endonuclease 
recognitionsequence.orrestrictionsite, Upon restriction cleavage, the two sites yield 
5 cohesive termini to which a translatable DNA sequence can be ligated to the DNA 
expression vector. Preferably, the two restriction sites provide, upon restriction cleavage, 
cohesive termini that are non-complementary and thereby permit directional insertion of 
a translatable DNA sequence into the cassette. In one embodiment, the directtonal 
ligation means is provided by nucleotides present in the upstream translatable DNA 
1 0 sequence, downstream translatable DNA sequence, or both. In another embodrment, the 
sequence of nucleotides adapted for directional ligation comprises a sequence of 
nucleotides that defines multiple directional cloning means. Where the sequence of 
nucleotides adapted for directional ligation defines numerous restriction sites, it is 
referred to as a multiple cloning site. 

15 In a preferred embodiment, a DNA expression vector is designed for convenient 
m ani P ulation in the form of a filamentous phage particle encapsulating DNA encoding 
the zinc finger-nucleotide binding polypeptides of the present invention. In tins 
embodiment, aDNAexpression vector further contains a nucleotide sequence that defines 
a filamentous phage origin of replication such that the vector, upon presentaUon of the 
20 appropriate genetic complementation, can replicate as a filamentous phage in smgle 
stranded replicative form and be packaged into filamentous phage particles. This feature 
provides the ability of toe DNA expression vector to be packaged into phage particles for 
subsequent segregation of the particle, and vector contained therein, away from other 
particles that comprise a population of phage particles using screening techmque well 
25 known in the art. 

A filamentous phage origin of replication is a region of the phage genome, as is well 
known, that defines sites for initiation of replication, termination of replicate and 
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packaging of the replicative form produced by replication (see, for example, Rasched, et 
al, Microbiol Rev., 5Q:40 1-427, 1986; and Horiuchi, J. Mol Biol, 188:215-223, 1986). 

A preferred filamentous phage origin of replication for use in the present invention is an 
M13, fl or fd phage origin of replication (Short, et al (Nucl Acids Res., 16:7583-7600, 
5 1 988). Preferred DNA expression vectors are the expression vectors modified pCOMB3 
and specifically pCOMB3.5. 

The production of a DNA sequence encoding a zinc finger-nucleotide binding polypeptide 
can be accomplished by oligonucleotide(s) which are primers for amplification of the 
genomic polynucleotide encoding an zinc finger-nucleotide binding polypeptide. These 

1 0 unique oligonucleotide primers can be produced based upon identification of the flanking 
regions contiguous with the polynucleotide encoding the zinc finger-nucleotide binding 
polypeptide. These oligonucleotide primers comprise sequences which are capable of 
hybridizing with the flanking nucleotide sequence encoding a zinc finger-nucleotide 
binding polypeptide and sequences complementary thereto and can be used to introduce 

1 5 point mutations into the amplification products. 

The primers of the invention include oligonucleotides of sufficient length and appropriate 
sequence so as to provide specific initiation of polymerization on a significant number of 
nucleic acids in the polynucleotide encoding the zinc finger-nucleotide binding 
polypeptide. Specifically, the term "primer" as used herein refers to a sequence compris- 

20 ing two or more deoxyribonucleotides or ribonucleotides, preferably more than three, 
which sequence is capable of initiating synthesis of a primer extension product, which is 
substantially complementary to a zinc finger-nucleotide binding protein strand, but can 
also introduce mutations into the amplification products at selected residue sites. 
Experimental conditions conducive to synthesis include the presence of nucleoside 

25 triphosphates and an agent for polymerization and extension, such as DNA polymerase, 
and a suitable buffer, temperature and pH. The primer is preferably single stranded for 
maximum efficiency in amplification, but may be double stranded. If double stranded, 
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the primer is first treated to separate the two strands before being used to prepare 
extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer 
must be sufficiently long to prime the synthesis of extension products in the presence of 
the inducing agent for polymerization and extension of the nucleotides. The exact length 
5 of primer will depend on many factors, including temperature, buffer, and nucleotide 
composition. The oligonucleotide primer typically contains 1 5-22 or more nucleotides, 
although it may contain fewer nucleotides. Alternatively, as is well known in the art, the 
mixture of nucleoside triphosphates can be biased to influence the formation of mutations 
to obtain a library of cDNAs encoding putative zinc fmger-nucleotide binding 
1 0 polypeptides that can be screened in a functional assay for binding to a zinc fmger- 
nucleotide binding motif, such as one in a promoter in which the binding inhibits 
transcriptional activation. 

Primers of the invention are designed to be "substantially" complementary to a segment 
of each strand of polynucleotide encoding the zinc fmger-nucleotide binding protein to 

15 be amplified. This means that the primers must be sufficiently complementary to 
hybridize with their respective strands under conditions which allow the agent for 
polymerization and nucleotide extension to act. In other words, the primers should have 
sufficient complementarity with the flanking sequences to hybridize therewith and permit 
amplification of the polynucleotide encoding the zinc fmger-nucleotide binding protein. 

20 Preferably, the primers have exact complementarity with the flanking sequence strand. 

Oligonucleotide primers of the invention are employed in the amplification process which 
is an enzymatic chain reaction that produces exponential quantities of polynucleotide 
encoding the zinc finger-nucleotide binding polypeptide relative to the number of reaction 
steps involved. Typically, one primer is complementary to the negative (-) strand of the 
25 polynucleotide encoding the zinc finger-nucleotide binding protein and the other is 
complementary to the positive ( + ) strand. Annealing the primers to denatured nucle.c 
acid followed by extension with an enzyme, such as the large fragment of DNA 
Polymerase I (Klenow) and nucleotides, results in newly synthesized (+) and (-) strands 
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containing the zinc finger-nucleotide binding protein sequence. Because these newly 
synthesized sequences are also templates, repeated cycles of denaturing, primer annealing, 
and extension results in exponential production of the sequence (i.e., the zinc fmger- 
nucleotide binding protein polynucleotide sequence) defined by the primer. The product 
5 of the chain reaction is a discrete nucleic acid duplex with termini corresponding to the 
ends of the specific primers employed. Those of skill in the art will know of other 
amplification methodologies which can also be utilized to increase the copy number of 
target nucleic acid. These may include for example, ligation activated transcription 
(LAT), ligase chain reaction (LCR), and strand displacement activation (SDA), although 
1 0 PCR is the preferred method. 

The oligonucleotide primers of the invention may be prepared using any suitable method, 
such as conventional phosphotriester and phosphodiester methods or automated 
embodiments thereof. In one such automated embodiment, diethylphosphoramidites are 
used as starting materials and may be synthesized as described by Beaucage, et al 
15 {Tetrahedron Letters, 22:1859-1862, 1981). One method for synthesizing 
oligonucleotides on a modified solid support is described in U.S. Patent No. 4,458,066. 
One method of amplification which can be used according to this invention is the 
polymerase chain reaction (PCR) described in U.S. Patent Nos. 4,683,202 and 4,683 ? 195. 

Methods for utilizing filamentous phage libraries to obtain mutations of peptide 
20 sequences are disclosed in U. S. Patent 5,223,409 to Ladner et al., which is incorporated 
by reference herein in its entirety. 

In one embodiment of the invention, randomized nucleotide substitutions can be 
performed on the DNA encoding one or more fingers of a known zinc finger protein to 
obtain a derived polypeptide that modifies gene expression upon binding to a site on the 
25 DNA containing the gene, such as a transcriptional control element. In addition to 
modifications in the amino acids making up the zinc finger, the zinc finger derived 
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20 an^untofcomplcmcn^sonndmaynotbeVnown fme^ ^ 

„ diagnostic applications, so that the amount o O-F maMr , 

complementary arand camto, be detenniued ^^J^l***™™ 
how^^amonntof^uddedwf,!^^^ 

25 mKmieofeompUcated^hainnuctaeacdauunds. Alarg 
to improve the efficiency of the process. 
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The deoxyribonucleotide triphosphates dATP, dCTP, dGTP, and dTTP are added to the 
synthesis mixture, either separately or together with the primers, in adequate amounts and 
the resulting solution is heated to about 90°-100°C from about 1 to 10 minutes, preferably 
from 1 to 4 minutes. After this heating period, the solution is allowed to cool to a 
5 temperature that is preferable for the primer hybridization. To the cooled mixture is 
added an appropriate agent for effecting the primer extension reaction (called herein 
"agent for polymerization"), and the reaction is allowed to occur under conditions known 
in the art. The agent for polymerization may also be added together with the other 
reagents if it is heat stable. This synthesis (or amplification) reaction may occur at room 
1 0 temperature up to a temperature above which the agent for polymerization no longer 
functions. Most conveniently the reaction occurs at room temperature. 

The agent for polymerization may be any compound or system which will function to 
accomplish the synthesis of primer extension products, including enzymes. Suitable 
enzymes for this purpose include, for example, E. coli DNA polymerase I, Klenow 

15 fragment of E. coli DNA polymerase I, T4 DNA polymerase, other available DNA 
polymerases, polymerase muteins, reverse transcriptase, and other enzymes, including 
heat-stable enzymes (i.e., those enzymes which perform primer extension after being 
subjected to temperatures sufficiently elevated to cause denaturation). Suitable enzymes 
will facilitate combination of the nucleotides ii; the proper manner to form the primer 

20 extension products which are complementary to each zinc finger-nucleotide binding 
protein nucleic acid strand. Generally, the synthesis will be initiated at the 3' end of each 
primer and proceed in the 5' direction along the template strand, until synthesis 
terminates, producing molecules of different lengths. There may be agents for 
polymerization, however, which initiate synthesis at the 5 r end and proceed in the other 

25 direction, using the same process as described above. 

The newly synthesized zinc finger-nucleotide binding polypeptide strand and its 
complementary nucleic acid strand will form a double-stranded molecule under 
hybridizing conditions described above and this hybrid is used in subsequent steps of the 
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process. In the next srep, rhe newry syndresired doubie-srranded ■** '»*-' 
L denanrring conditions using any of the procedures described above to prov.de s,n 8 .e- 
stranded molecules. 

n. above process is repented on the srng.e-snnnded molecu.es Additional agen, for 
5 ^erizaL, nuclides, and primers may be added, if necessa^, fo, .be reaetror , . 
Itinndertireconditionsprescribedabove. Again, tire stasis * be — 
„e end of each oftire oligonucleotide primers and vriU proceed along *= srngle nan 
0 f tire renrpiare ,o produce additiona, nucleic acid. After .bis s«p, ha* ofti* — 
pro duc, 1 eonsis, of tire specific nuclerc acid sequence bounded by tire rwo prrmers. 

10 Tbe steps of denaruring and extensron produc, syntiresis can be repeared as often - 

ex ,en. necess^y foe demotion. The anroun, of .be specrfrc nuclerc acd sequence 
produced will occumulare in an exponential fashion. 

Sequencesamp,ifiedbyti*meu,odsofthe invention eon be fin^r evalua«d, derecred 
1S Coned, scouenced, and tire eitirer in solution or after binding ,o a sohd supply 
M y merhod usually applied .o tire detection of a specrfrc DNA sequence such as PC*. 

,9 3), oiigonucleotide .igation assays (OLAs) (Undegren, e, a,.. Serene., 241. ^ 
20 g an tire Itire. Molecular techniques foe DKA -** have been revrewed 

H JIZ allow nrodifieation. such as substitution, of nudeorrdea such that 
urtique zinc fingers ace produced (See Examples fo, fimher detail). 

25 ,„ tire present mvenrion, tire ztirc finger-nudeotide binding polypeptide encoding 
nucleotide sequences may be inserted into a recombinant expression vecror. The rem, 
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"recombinant expression vector" refers to a plasmid, virus or other vehicle known in the 
art that has been manipulated by insertion or incorporation of zinc finger derived- 
nucleotide binding protein genetic sequences. Such expression vectors contain a 
promotor sequence which facilitates the efficient transcription of the inserted genetic 
5 sequence in the host. The expression vector typically contains an origin of replication, 
a promoter, as well as specific genes which allow phenotypic selection of the transformed 
cells. Vectors suitable for use in the present invention include, but are not limited to the 
T7-based expression vector for expression in bacteria (Rosenberg, et al t Gene 56:125, 
1987), the pMSXND expression vector for expression in mammalian cells (Lee and 
1 0 Nathans, J. Biol. Chem. 263 :3521 , 1 988) and baculovirus-derived vectors for expression 
in insect cells. The DNA segment can be present in the vector operably linked to 
regulatory elements, for example, a promoter {e.g., T7, metallothionein 1, or polyhedrin 
promoters). 

DNA sequences encoding novel zinc finger-nucleotide binding polypeptides of the 
1 5 invention can be expressed in vitro by DNA transfer into a suitable host cell. "Host cells" 
are cells in which a vector can be propagated and its DNA expressed. The term also 
includes any progeny of the subject host cell. It is understood that all progeny may not 
be identical to the parental cell since there may be mutations that occur during replication. 
However, such progeny are included when the term "host cell" is used. Methods of stable 
20 transfer, in other words when the foreign DNA is continuously maintained in the host, are 
known in the art. 

Transformation of a host cell with recombinant DNA may be carried out by conventional 
techniques as are well known to those skilled in the art. Where the host is prokaryotic, 
such as E. colU competent cells which are capable of DNA uptake can be prepared from 
25 cells harvested after exponential growth phase and subsequently treated by the CaCl 2 
method by procedures well known in the art. Alternatively, MgCl 2 or RbCl can be used. 
Transformation can also be performed after forming a protoplast of the host cell or by 
electroporation. 



PCT/US95/00829 

WO 95/19431 

-34- 

"ta.es convention, mechanic*, process such as on, 
used. 

v.** utilized to express the zinc finger 
* A variety of host-expression vector systems may be utilized to e P 

5 "nuclide codins seance. These but are « J— • 

aLorganisnts such as bacteria —a - raccnbinan, ™' 
rid DNA or cosmid DNA expressioa vectors containing a anc finger 

, o yeas, cession vectors — the ainc finger—. ^-*"« — 
mos aic virus, CaMV; robacco mosaic vnus, TMV) o, 

20 plvide far —a, - P—— ">°™ * *" "* 8 " 
mammalian, iasea, yeast or plaot expression systems. 

a .nvnfaniimberof suitable transcription 
rwndina on the host/vector system utilized, any of a number 

ptrp ptac (ptrp-lac hybrid promoter) and the like may 

ptrp, pui f genome of mammalian cells (e.g., 

mammalian cell systems, promoters derived from the genome 
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metallothionein promoter) or from mammalian viruses {e.g., the retrovirus long terminal 
repeat; the adenovirus late promoter; the vaccinia virus 7.5K promoter) may be used. 
Promoters produced by recombinant DNA or synthetic techniques may also be used to 
provide for transcription of the inserted zinc finger-nucleotide binding polypeptide coding 
5 sequence. 

In bacterial systems a number of expression vectors may be advantageously selected 
depending upon the use intended for the zinc finger derived nucleotide-binding 
polypeptide expressed. For example, when large quantities are to be produced, vectors 
which direct the expression of high levels of fusion protein products that are readily 
purified may be desirable. Those which are engineered to contain a cleavage site to aid 
in recovering the protein are preferred. Such vectors include but are not limited to the E. 
coli expression vector pUR278 (Ruther, et aL y EMBO J., 2:1791, 1 983), in which the zinc 
finger-nucleotide binding protein coding sequence may be ligated into the vector in frame 
with the lac Z coding region so that a hybrid zinc finger-lac Z protein is produced; pFN 
vectors (Inouye & Inouye, Nucleic Acids Res. 12:3101-3109, 1985; Van Heeke & 
Schuster, J. Biol. Chem. 264:5503-5509, 1989); and the like. 

In yeast, a number of vectors containing constitutive or inducible promoters may be used. 
For a review see, Current Protocols in Molecular Biology, Vol. 2, 1988, Ed. Ausubel, 
et al, Greene Publish. Assoc. & Wiley Interscience, Ch. 13; Grant, et a/., 1987, 
20 Expression and Secretion Vectors for Yeast, in Methods in Enzymology, Eds. Wu & 
Grossman, 3 1987, Acad. Press, N.Y., Vol. 153, pp.516-544; Glover, 1986, DNA Cloning, 
Vol. II, IRL Press, Wash., D.C., Ch. 3; and Bitter, 1987, Heterologous Gene Expression 
in Yeast, Methods in Enzymology, Eds. Berger & Kimmel, Acad. Press, N.Y., Vol. 152, 
pp. 673-684; and The Molecular Biology of the Yeast Saccharomyces, 1982, Eds. 
25 Strathern et al., Cold Spring Harbor Press, Vols. I and II. A constitutive yeast promoter 
such as ADH or LEU2 or an inducible promoter such as GAL may be used (Cloning in 
Yeast, Ch. 3, R. Rothstein In: DNA Cloning Vol.11, A Practical Approach, Ed. DM 
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Ciover 1986, IRL Press, Wash., D.C.). Ahen»veiy, vec,ors may be used which 
promote integral of foreign DN A m— «" *• ^as. chromosome. 

In « where plan, expression vaciora are used, the expression of a zinc fmger. 

leoride binding peptide coding seance may be driven by any of a number of 
nucleotide unoing y v promoters 
5 promoter, For example, viral promoters such as the 35S RNA and P ^ 

, x, . 1 1 n< 1 1-514 1984), or the coat protein promoter to 
0 fCaMV(Brisson,effl/.,Na/«rc,21fl.5n-514, iviwj, p 

TMV crJL-, - ^O,, *307-31 i, . W -» - -* — ' £ 
p r omoerssuchasmesm a l 1 suhuni t ofRUB I SCO<Con^,e,a ( .,«,0, 1; ,67 1 ..6^ 

These consul con be mhoduced h«o p.an, ce„s using Ti piaamids, Pu piasmrd , piam 
voters diree, DNA nansformarion, microinjection, eiec.ropor.tton, etc. For 
vrrus vectors, drrec. wdssb ach & Weissbach, Me.tais/»r /"tor 

Molecular Biology, Academic Press, NY, Secoon vm, pp 
15 & Corny, Plan, Molecular Biology, 2d Ed., Blackie, London, Ch. 7-9, 

A,*— expression system to. can be used, o express a prorein of me invenrion is 

Ta^V) susedasavocorroexpaessforeigngenes. The vims 
Lei cei,, The zinc <mger-nuc.eo.idc hinding pOypepude coding se,uence « 
Clod in.0 non-essonua, regions ( * — » f 
L of me vtus and piaced under conrroi of an AoNPV promo.er (for exampie .he 
Sedrm promorer, Success™, inserfon of mo zinc fmger-nucieo., de bmdm 
poiypepride coding s=,ue„ce wU. rosuU in inacrivarion of me poiyhedrm gene -d 
poiypepuuc B ^rnciackine the proteinaceous coat 

pre^onofne^iudedaecumoh^^fae^^^ 

c wi edforbytepoiyhednngene).Thesereoombuw..v m sesa K *en 
ta which the insened gene is expressed. (Eg., see Smith, . A * « 
Smith, U.S. Patent No. 4,215,051). 
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Eukaryotic systems, and preferably mammalian expression systems, allow for proper 
post-translational modifications of expressed mammalian proteins to occur. Therefore, 
eukaryotic cells, such as mammalian cells that possess the cellular machinery for proper 
processing of the primary transcript, glycosylation, phosphorylation, and, advantageously 
5 secretion of the gene product, are the preferred host cells for the expression of a zinc 
finger derived-nucleotide binding polypeptide. Such host cell lines may include but are 
not limited to CHO, VERO, BHK, HeLa, COS, MDCK, -293, and WI38. 

Mammalian cell systems that utilize recombinant viruses or viral elements to direct 
expression may be engineered. For example, when using adenovirus expression vectors, 

1 0 the coding sequence of a zinc finger derived polypeptide may be ligated to an adenovirus 
transcription/translation control complex, e.g., the late promoter and tripartite leader 
sequence. This chimeric gene may then be inserted into the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential region of the viral genome 
(e.g., region El or E3) will result in a recombinant virus that is viable and capable of 

1 5 expressing the zinc finger polypeptide in infected hosts (e.g., see Logan & Shenk, Proc. 
Natl. Acad. Sci. USA £1:3655-3659, 1984). Alternatively, the vaccinia virus 7.5K 
promoter may be used, (e.g., see, Mackett, et al, Proc. Natl. Acad. Sci. USA, 22:7415- 
7419, 1982; Mackett, et a!., J. Virol. 42:857-864, 1984; Panicali, et al, Proc. Natl. Acad. 
Sci. USA , 22:4927-4931, 1982). Of particular interest are vectors based on bovine 

20 papilloma virus which have the ability to replicate as extrachromosomal elements (Sarver, 
et al, Mol Cell Biol. 1:486, 1981). Shortly after entry of this DNA into mouse cells, the 
plasmid replicates to about 100 to 200 copies per cell. Transcription of the inserted 
cDNA does not require integration of the plasmid into the host's chromosome, thereby- 
yielding a high level of expression. These vectors can be used for stable expression by 

25 including a selectable marker in the plasmid, such as the neo gene. Alternatively, the 
retroviral genome can be modified for use as a vector capable of introducing and directing 
the expression of the zinc finger-nucleotide binding protein gene in host cells (Cone & 
Mulligan, iVoc. Natl. Acad. Sci. USA 81:6349-6353, 1984). High level expression may 
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also be achieved using inducibie promoters, including, bu, no, limired ro, .he 
metnlloihionine II A promoter and heat shock promoters. 

For long-term, high-yield production of recombinant proteins, stable expression is 
preferred. Rather tan using expression vectors which contain visa, origins of rephcauon, 
5 host ce„s can be ^formed wi* me a oDNA controued by appropriate express 
conuo. elements (e.g., promoter, enhancer, sequence* transcription ,e— , 
po.yadeny.auon sites, etc.), and a select marker. The se,ecb,e market m me 

te p,asm,d into men chromosomes and grow ,„ form foci which in mm can be doned 
n0 and expanded into ceU lines. For example, following the introduction 

engineered ceUs may be ai.owed.ogmw for ! -2 daysman enriched medra, an hen.e 
s^hedtoasOectivemedia. A number of se,ecdon systems may be use ,mcudmgbn« 
no. limited to me herpea simp.ex virus thymidine kinase (Wigler, of., CeU JUft 
.977) hvpoxanthine-guanine phosphoribosy.unnsferase (Szytalska * Sayba.sk, Proc. 
,8 m Acai. SC. U&LttHK l%2),andadenmephosphoribo^.transferase(Lowy « 
" at. Cell 22:817, .980) genes, which can be employed in tk, hgprr or aprr cells 
respectively. Also, antimetabolite resistance-conferring genes can be used as the basts 
of selection; for examp,e, th. genes for dhfr, which confers resistance '° — ' 
(Wigler, a, al, Noll. Ac**- SC. USA .77:3567, .980; O'Hare, el at, Proc. « - * 
20 W. , 22:1527, ,98,,; gP, which confers resistance to mycopheno.ic acid (MuUtgan * 
Berg 7>roc mi. Acad. SC. USA, 21:2072, .98.; neo, which ennfers resrsrance ,0 the 
Jnogiycosnie <M.8 (Co^-Garapin, « »,., , W «. !»•• «'* - *■» 

nddiriona, ae.ecmHe genes have been described, name.y trpB, which aHows ee. m 
25 utffize indo.e in p.aee of tryptophan; hisD, which ,,ows ce.,s to utihze htsttno, m place 
tf^W***^***-* USA, O0M. .9 8); and ODC 
ethane decarboxy.ase) which confers resisrance to me ornithine de— 
,„hibi,or. 2-(dinuotomemy.)-DL.omidnne, DFMO (McCoruogue L •» Current 
Co^tm/cntmns m Uoleclor Siology. Co.d Spring Harbor Uboratory ed., 1987). 
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Isolation and purification of microbially expressed protein, or fragments thereof provided 
by the invention, may be carried out by conventional means including preparative 
chromatography and immunological separations involving monoclonal or polyclonal 
antibodies. Antibodies provided in the present invention are immunoreactive with the 
5 zinc finger-nucleotide binding protein of the invention. Antibody which consists 
essentially of pooled monoclonal antibodies with different epitopic specificities, as well 
as distinct monoclonal antibody preparations are provided. Monoclonal antibodies are 
made from antigen containing fragments of the protein by methods well known in the art 
(Kohler, et aL Nature, 256:495, 1 975; Current Protocols in Molecular Biology, Ausubel, 
10 etal.,ed., 1989). 

The present invention also provides gene therapy for the treatment of cell proliferative 
disorders which are associated with a cellular nucleotide sequence containing a zinc 
finger-nucleotide binding motif Such therapy would achieve its therapeutic effect by 
introduction of the zinc finger-nucleotide binding polypeptide polynucleotide, into cells 
1 5 of animals having the proliferative disorder. Delivery of a polynucleotide encoding a zinc 
finger-nucleotide binding protein can be achieved using a recombinant expression vector 
such as a chimeric virus or a colloidal dispersion system, for example. 

The term M cell-proliferative disorder" denotes malignant as well as non-malignant cell 
populations which morphologically often appear to differ from the surrounding tissue. 

20 The cell-proliferative disorder may be a transcriptional disorder which results in an 
increase or a decrease in gene expression level. The cause of the disorder may be of 
cellular origin or viral origin. Gene therapy using a zinc finger-nucleotide binding 
polypeptide can be used to treat a virus-induced cell proliferative disorder in a human, for 
example, as well as in a plant. Treatment can be prophylactic in order to make a plant 

25 cell, for example, resistant to a virus, or therapeutic, in order to ameliorate an established 
infection in a cell, by preventing production of viral products. 
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A polynucleotide encoding the zinc finger-nucleotide binding polypeptide is useful m 
treating malignancies of the various organ systems, such as, for example, lung, breast, 
lymphoid, gastrointestinal, and genito-urinary tract as well as adenocarcinomas which 
include malignancies such as most colon cancers, renal-cell carcinoma, prostate cancer, 
5 non-small cell carcinoma of the lung, cancer of the small intestine, and cancer of the 
esophagus A polynucleotide encoding the zinc finger-nucleotide binding polypeptide » 
also useful in trearing non-malignant cell-proliferative diseases such as psoras, 
pemphigus vulgaris, Behcet's syndrome, and lipid histiocytosis. Essentially, any disorder 
which is etiologically linked to the activation of a zinc finger-nucleotide binding mouf 
10 containing promoter, structural gene, or RNA, would be considered susceptible to 
treatment with a polynucleotide encoding a derivative or variant zinc finger denved- 
nucleotide binding polypeptide. 

Various viral vectors that can be utilized for gene therapy as taught herein include 
adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a retrovirus. 
15 Preferably, the retroviral vector is a derivative of a murine or avian retrovirus. Examples 
of retroviral vectors in which a single foreign gene can be inserted include, but are not 
toited to: Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma virus 
(HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). 
A number of additional retroviral vectors can incorporate multiple genes. All of these 
20 vectorscantransferormc^ 

can be identified and generated. By inserting a zinc finger derived-DNA bmdmg 
polypeptide sequence of interest into the viral vector, along with another gene that 
encodes the ligand for a receptor on a specific target cell, for example, the vector is made 
target specific. Retroviral vectors can be made target specific by inserting, for example, 
25 a polynucleotide encoding a protein. Preferred targeting is accomplished by usmg an 
antibody to target the retroviral vector. Those of skill in the art will know of, or can 
readily ascertain without undue experimentation, specific polynucleotide sequences which 
can be inserted into the retroviral genome to allow target specific delivery of the retroviral 
vector containing the zinc finger-nucleotide binding protein polynucleotide. 
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Since recombinant retroviruses are defective, they require assistance in order to produce 
infectious vector particles. This assistance can be provided, for example, by using helper 
cell lines that contain plasmids encoding all of the structural genes of the retrovirus under 
the control of regulatory sequences within the LTR. These plasmids are missing a 

5 nucleotide sequence which enables the packaging mechanism to recognize an RNA 
transcript for encapsitation. Helper cell lines which have deletions of the packaging 
signal include but are not limited to Y2, PA317 and PA 12, for example. These cell lines 
produce empty virions, since no genome is packaged. If a retroviral vector is introduced 
into such cells in which the packaging signal is intact, but the structural genes are 

10 replaced by other genes of interest, the vector can be packaged and vector virion 
produced. The vector virions produced by this method can then be used to infect a tissue 
cell line, such as NIH 3T3 cells, to produce large quantities of chimeric retroviral virions. 

Another targeted delivery system for polynucleotides encoding zinc finger deri ved-DNA 
binding polypeptides is a colloidal dispersion system. Colloidal dispersion systems 

1 5 include macromolecule complexes, nanocapsules, microspheres, beads, and lipid-based 
systems including oil-in-water emulsions, micelles, mixed micelles, and liposomes. The 
preferred colloidal system of this invention is a liposome. Liposomes are artificial 
membrane vesicles which are useful as delivery vehicles in vitro and in vivo. It has been 
shown that large unilamellar vesicles (LUV), which range in size from 0.2-4.0 urn can 

20 encapsulate a substantial percentage of an aqueous buffer containing large 
macromolecules. RNA, DNA and intact virions can be encapsulated within the aqueous 
interior and be delivered to cells in a biologically active form (Fraley, et al, Trends 
Biochem. Set, 6:77, 1981). In addition to mammalian cells, liposomes have been used 
for delivery of polynucleotides in plant, yeast and bacterial cells. In order for a liposome 

25 to be an efficient gene transfer vehicle, the following characteristics should be present: 
(1) encapsulation of the genes of interest at high efficiency while not compromising their 
biological activity; (2) preferential and substantial binding to a target cell in comparison 
to non-target cells; (3) delivery of the aqueous contents of the vesicle to the target cell 
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cytoplasm at high efficiency; and (4) accurate and effective expression of genetic 
information (Mannino, et al, Biotechniques, 6:682, 1988). 

The composition of the liposome is usually a combination of phospholipids, particularly 
high-phase-transition-temperature phospholipids, usually in combination with sterols, 
5 especially cholesterol. Other phospholipids or other lipids may also be used. Tne 
physical characteristics of liposomes depend on P H, ionic strength, and the presence of 
divalent cations. 

Examples of lipids useful in liposome production include phosphatidyl compounds, such 
as phosphatidylglycerol, phosphatidylcholine, phosphatidyl, phosphatidyletha- 
oolamine, sphingoids, eerebrosides, and gangliosides. Particularly useful are 
diacylphosphatidylglycerols, where the lipid moiety contains from .4-18 carta atoms, 
particularly from ,6-18 carbon atoms, and is saturated. Hlushauve phospholipids tnclude 
egg phosphatidylcholine, dipalmitoylphosphatidyleholine and 
distearoylphosphatidylcholine. 

The targeting of liposomes has been classified baaed on anatomical and mechanistic 
factors. Anatomical classification is based on -he level of selectivity, for example, organ- 
specific, cell-specific, and organelle-speciftc. Mechanist targeting can be disttngutshed 
teed upon whether iris passive or active. Passive targeting unites the natural tendency 
of liposomes to distribute to cells of the reticulo-endothelial system (RES) in organs 
which contain sinusoidal capillaries. Active targeting, on the other hand, involves 
alteration of the liposome by coupling the liposome to a specific ligand such aa a 
nronoclonal antihody.sugar, glytapid, or protein, or by changing the composition tor *ze 
of the liposome in order to achieve targeting to organs and cel. types other than the 
naturally occurring sites of localization. 

The surface of the targeted delivery system may be modified in a variety of ways. In the 
case of a liposomal targeted delivery system, lipid groups can be incorporated mto the 
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lipid bilayer of the liposome in order to maintain the targeting ligand in stable association 
with the liposomal bilayer. Various linking groups can be used for joining the lipid 
chains to the targeting ligand. 

In general, the compounds bound to the surface of the targeted delivery system will be 
5 ligands and receptors which will allow the targeted delivery system to find and "home in" 
on the desired cells. A ligand may be any compound of interest which will bind to 
another compound, such as a receptor. 

In general, surface membrane proteins which bind to specific effector molecules are 
referred to as receptors. In the present invention, antibodies are preferred receptors. 
Antibodies can be used to target liposomes to specific cell-surface ligands. For example, 
certain antigens expressed specifically on tumor cells, referred to as tumor-associated 
antigens (TAAs), may be exploited for the purpose of targeting antibody-zinc finger- 
nucleotide binding protein-containing liposomes directly to the malignant tumor. Since 
the zinc fmger-nucleotide binding protein gene product may be indiscriminate with 
respect to cell type in its action, a targeted delivery system offers a significant 
improvement over randomly injecting non-specific liposomes. A number of procedures 
can be used to covalently attach either polyclonal or monoclonal antibodies to a liposome 
bilayer. Antibody-targeted liposomes can include monoclonal or polyclonal antibodies 
or fragments thereof such as Fab, or F(ab') 2 , as long as they bind efficiently to an the 
antigenic epitope on the target cells. Liposomes may also be targeted to cells expressing 
receptors for hormones or other serum factors. 

In another embodiment, the invention provides a method for obtaining an isolated zinc 
fmger-nucleotide binding polypeptide variant which binds to a cellular nucleotide 
sequence comprising, first, identifying the amino acids in a zinc finger-nucleotide binding 
25 polypeptide that bind to a first cellular nucleotide sequence and modulate the function of 
the nucleotide sequence. Second, an expression library encoding the polypeptide variant 
containing randomized substitution of the amino acids identified in the first step is 
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created. Third, the library is expressed in a suitable host cell, which will be apparent to 
those of skill in the ait, and finally, a clone is isolated that produces a polypeptide variant 
that binds to a second cellular nucleotide sequence and modulates the function of the 
second nucleotide sequence. The invention also includes a zinc finger-nucleotide binding 
5 polypeptide variant produced by the method described above. 

Preferably, a phage surface expression system, as described in the Examples of the 
present disclosure, is utilized as the library. The phage library is treated with a reducing 
reagent, such as dithiothreitol, which allows proper folding of the expression product on 
the phage surface. The library is made from polynucleotide sequences which encode a 
10 zinc finger-nucleotide binding polypeptide variant and which have been randomized, 
preferably by PCR using primers containing degenerate triplet codons at sequence 
locations corresponding to the determined amino acids in the first step of the method. 
The degenerate triplet codons have the formula NNS or NNK, wherein S is either G or 
C, K is either G or T, and N is independently selected from the group consisting of A, C, 
15 G, or T. 

The modulation of the function of the cellular nucleotide sequence includes the 
enhancement or suppression of transcription of a gene operatively linked to the cellular 
nucleotide sequence, particularly when the nucleotide sequence is a promoter. The 
modulation also includes suppression of transcription of a nucleotide sequence which is 
20 within a structural gene or a virus DNA or RNA sequence. Modulation also includes 
inhibition of translation of a messenger RNA. 

In addition, the invention discloses a method of treating a cell proliferative disorder, by 
the ex vivo introduction of a recombinant expression vector comprising the polynucleotide 
encoding a zinc finger-nucleotide binding polypeptide into a cell to modulate in a cell the 
25 function of a nucleotide sequence comprising a zinc finger-nucleotide binding motif. The 
cell proliferative disorder comprises those disorders as described above which are 
typically associated with transcription of a gene at reduced or increased levels. The 
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method of the invention offers a technique for modulating such gene expression, whether 
at the promoter, structural gene, or RNA level. The method includes the removal of a 
tissue sample from a subject with the disorder, isolating hematopoietic or other cells from 
the tissue sample, and contacting isolated cells with a recombinant expression vector 

5 containing the DNA encoding zinc finger-nucleotide binding protein and, optionally, a 
target specific gene. Optionally, the cells can be treated with a growth factor, such as 
interleukin-2 for example, to stimulate cell growth, before reintroducing the cells into the 
subject. When reintroduced, the cells will specifically target the cell population from 
which they were originally isolated. In this way, the trans-repressing activity of the zinc 

1 0 finger-nucleotide binding polypeptide may be used to inhibit or suppress undesirable cell 
proliferation in a subject. In certain cases, modulation of the nucleotide sequence in a cell 
refers to suppression or enhancement of the transcription of a gene operatively linked to 
a cellular nucleotide sequence. Preferably, the subject is a human. 

An alternative use for recombinant retroviral vectors comprises the introduction of 
1 5 polynucleotide sequences into the host by means of skin transplants of cells containing 
the virus. Long term expression of foreign genes in implants, using cells of fibroblast 
origin, may be achieved if a strong housekeeping gene promoter is used to drive 
transcription. For example, the dihydrofolate reductase (DHFR) gene promoter may be 
used. Cells such as fibroblasts, can be infected with virions containing a retroviral 
20 construct containing the gene of interest, for example a truncated and/or mutagenized zinc 
finger-nucleotide binding protein, together with a gene which allows for specific 
targeting, such as tumor-associated antigen (TAA), and a strong promoter. The infected 
cells can be embedded in a collagen matrix which can be grafted into the connective 
tissue of the dermis in the recipient subject. As the retrovirus proliferates and escapes the 
25 matrix it will specifically infect the target cell population. In this way the transplantation 
results in increased amounts of trans-repressing zinc finger-nucleotide binding 
polypeptide being produced in cells manifesting the cell proliferative disorder. 
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The novel zinc finger-nucleotide binding proteins of >he invention, whieh modnlate 
rranscriptional activation ot uanslation eidte, at <he promoter, stntcnnal gene, or RN A 
Uvel couid be used in piantspec.es as weli. Transgenic plants wonid be prodnced sueh 
tot fc plant is resistant to particular bacrerial or vW pathogens, for exampie. Medrods 
5 fort^femngandexpr.ssingnndeicacidsinplan^weHknovvnindr.an. (Seefor 

example, Hiatt, e, al, U.S. Parent No. 5,202,422, incorporated herein by reference.) 

,„ a further embodinten, the invention provides a method for identifying a modulating 

finger-nucleotide binding motif of mtercs, comprising incubating components, 
composing a nuclide sequence encoding the pu,ative modulating protein operably 
link ed ro a firs, inducible promoter and a reporter gene operab.y finked ,0 a second 
inducible promoter and a inofinger-nuclcofide binding mofif, wherein me inoubatmgts 
earned on, under conditions sufficient to allow me component ,o interne, and measunng 
rhe effect of Ore pmative modulating pmtein on the expression of the reporter gene. 

The tern, -modulating- envisions the inhibition o, suppression of expression from a 
promoter containing a zinc finger-nucleotide binding motif when k is over-acfivated or 
augmentation or enhancement of expression fiom such a promoter when it b under- 
activated. A Brat inducible promoter, sueh as me arabinose promorer, is operably hnke 
„ me nucleotide sequence encoding the putative modulating po.ypepfide. A second 
inducible promoter, such as the lactose promoter, is opembly linked to a zinc finger 
derived-DNA binding motif followed by a reporter gene, such as p-gaiac.os.dase. 
tacubafionofteeom^n^ybem^v,/^^^^™^ 1 "^ 
prokaryodc or eukaryodc systems, such as £«,« or COS cefis, respeefively. Condmons 
whieh aHowmeassay^proee^meludeincubafion in the presence ofasubaance, sueh 
as arabinose and .actose, which activate the first and second inducible promoters, 
respectively, thereby allowing expression of tire nucleotide sequence encodmg tire 
putative pans-modulating protein nucleotide sequence. Whether or no, tire putative 
modulating protein binds ,o the zinc finger-nucleotide binding motif which is operably 
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linked to the second inducible promoter, and affects its activity is measured by the 
expression of the reporter gene. For example, if the reporter gene was p-galactosidase, 
the presence of blue or white plaques would indicate whether the putative modulating 
protein enhances or inhibits, respectively, gene expression from the promoter. Other 
commonly used assays to assess the function from a promoter, including chloramphenicol 
acetyl transferase (CAT) assay, will be known to those of skill in the art. Both prokaryote 
and eukaryote systems can be utilized. 



The invention is useful for the identification of a novel zinc finger-nucleotide binding 
polypeptide derivative or variant and the nucleotide sequence encoding the polypeptide. 
1 0 The method entails modification of the fingers of a wild type zinc finger protein so that 
they recognize a nucleotide, either DNA or RNA, sequence other than the sequence 
originally recognized by that protein. For example, it may be desirable to modify a 
known zinc finger protein to produce a new zinc finger-nucleotide binding polypeptide 
that recognizes, binds to, and inactivates the promoter region (LTR) of human immunode- 
15 ficiency virus (HIV). Following identification of the protein, a truncated form of the 
protein is produced that represses transcription normally activated from that site. In HTV, 
the target site for a zinc finger-nucleotide binding motif within the promoter is CTG- 
TTG-TGT. The three fingers of zif268, for example, are mutagenized, as described in the 
examples. The fingers are mutagenized independently on the same protein (one by one), 
20 or independently or "piecewise" on three different zif268 molecules and religated after 
being mutagenized. Although one of these two methods is preferable, an alternative 
method would allow the three fingers to be mutagenized simultaneously. After 
mutagenesis, a phage display library is constructed and screened with the appropriate 
oligonucleotides which include the binding site of interest. If the fingers were 
25 mutagenized independently on the same protein, sequential libraries are constructed and 
panning performed after each library construction. For example, in zif268, a finger 3 
library is constructed and panned with a finger 3 specific oligo; the positive clones from 
this screen are collected and utilized to make a finger 2 library (using finger 3 library 
DNA as a template); panning is performed with a finger 32 specific oligo; DNA is 
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as 

zinc- 



collected from positive clones and used as a template for finger 1 library construction; 
finally selection for a protein with 3 new fingers is performed with a finger 321 specific 
oligo. The method results in identification of a new zinc finger derived-DNA binding 
protein that recognizes, binds to, and represses transcription from the HIV promoter. 
Subsequent truncation, mutation, or expansion of various fingers of the new protein 
would result in a protein which represses transcription from the HIV promoter. 

The invention provides, in EXAMPLES 7-13, an illustration of modification of Zif268 
: described above. Therefore, in another embodiment, the invention provides a novel 
c-finger-nucleotide binding polypeptide variant comprising at least two zinc finger 
1 0 modules that bind to an HTV sequence and modulates the function of the HIV sequence, 
for example, the HIV promoter sequence. 

The identification of novel zinc finger-nucleotide binding proteins allows modulation of 
gene expression from promoters to which these proteins bind. For example, when a cell 
proliferative disorder is associated with overactivation of a promoter which contains a 

15 zinc finger-nucleotide binding motif, such suppressive reagents as antisense 
polvnucleotide sequence or binding antibody can be introduced to a cell, as an alternative 
to the addition of a zinc finger-nucleotide binding protein derivative. Alternatively, when 
a cell proliferative disorder is associated with underactivation of the promoter, a sense 
polynucleotide sequence (the DNA coding strand) or zinc finger-nucleotide binding 

20 polypeptide can be introduced into the cell. 

Minor modifications of the primary amino acid sequence may result in proteins which 
have substantially equivalent activity compared to the zinc finger derived-binding protein 
described herein. Such modifications may be deliberate, as by site-directed mutagenesis, 
or may be spontaneous. All proteins produced by these modifications are included herein 
25 as long as zinc finger-nucleotide binding protein activity exists. 
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In another embodiment, zinc finger proteins of the invention can be manipulated to 
recognize and bind to extended target sequences. For example, zinc finger proteins 
containing from about 2 to 20 zinc fingers Zif(2) to Zif(20), and preferably from about 
2 to 12 zinc fingers, may be fused to the leucine zipper domains of the Jun/Fos proteins, 
5 prototypical members of the bZIP family of proteins (O'Shea, et al t Science, 254:539, 
1991). Alternatively, zinc finger proteins can be fused to other proteins which are capable 
of forming heterodimers and contain dimerization domains. Such proteins will be known 
to those of skill in the art. 

The Jun/Fos leucine zippers are described for illustrative purposes and preferentially form 

1 0 heterodimers and allow for the recognition of 12 to 72 base pairs. Henceforth, Jun/Fos 
refer to the leucine zipper domains of these proteins. Zinc finger proteins are fused to 
Jun, and independently to Fos by methods commonly used in the art to link proteins. 
Following purification, the Zif-Jun and Zif Fos constructs (SEQ ID NOS: 33, 34 and 35, 
36 respectively), the proteins are mixed to spontaneously form a Zif-Jun/Zif-Fos 

1 5 heterodimer. Alternatively, coexpression of the genes encoding these proteins results in 
the formation of Zif-Jun/Zif-Fos heterodimers in vivo. Fusion of the heterodimer with an 
N-terminal nuclear localization signal allows for targeting of expression to the nucleus 
(Calderon, et al, Cell, 41:499, 1982). Activation domains may also be incorporated into 
one or each of the leucine zipper fusion constructs to produce activators of transcription 

20 (Sadowski,^/., G^,il8:137, 1992). These dimeric constructs then allow for specific 
activation or repression of transcription. These heterodimeric Zif constructs are 
advantageous since they allow for recognition of palindromic sequences (if the fingers on 
both Jun and Fos recognize the same DNA/RNA sequence) or extended asymmetric 
sequences (if the fingers on Jun and Fos recognize different DNA/RNA sequences). For 

25 example the palindromic sequence 



5' - GGC CCA CGC |n! GCG TGG GCG - 3' 

3 ' - GCG GGT GCG W x CGC ACC CGC - 5 ' (SEQ ID NO: 37) 
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is recognized by the Zif268-Fos/Zif268 Jun dimer (x is any number). The spacing 
between subsites is determined by the site of fusion of Zif with the Jun or Fos zipper 
domains and the length of the linker between the Zif and zipper domains. Subsite spacing 
is determined by a binding site selection method as is common to those skilled in the art 
5 (TWesen, et al, Nucleic Acids Research, 1&3203, 1990). Example of the recognition of 
an extended asymmetric sequence is shown by Zif(C7) 6 -Jun/Zif-268-Fos dimer. This 
protein consists of 6 fingers of the C7 type (EXAMPLE 1 1) linked to Jun and three 
fingers of Zif268 linked to Fos, and recognizes the extended sequence: 

5 . - CGC CGC CGC CGC CGC CGC {N} GCG TGG GCG - 3 ' 
10 3 . . GCG GCG GCG GCG GCG GCG W x CGC ACC CGC - 5 

(SEQIDNO:38) 



Oxidative or hydrolytic cleavage of DNA or RNA with metal chelate complexes can be 
performed by methods known to those skilled in the art. In another embodiment, 
attachment of chelating groups to Zif proteins is preferably facilitated by the 
15 incorporation of a Cysteine (Cys) residue between the initial Methionine (Met) and the 
first Tyrosine (Tyr) of the protein. The Cys is then alkylated with chelators known to 
those skilled in the art, for example, EDTA derivatives as described (Sigman, 
Biochemistry, 22:9097, 1990). Alternatively the sequence Gly-Gly-His can be made as 
the most amino terminal residues since an amino terminus composed of the residues has 
been described to chelate Cu+2 (Mack, et al, J. Am. Chew. Soc, ilfl:7572, 1988). 
Preferred metal ions include Cu + 2, Ce+3 (Takasaki and Chin, J. Am. Chem. Soc, 
116:1121, 1994) Zn+2, Cd+2, Pb+2, Fe+2 (Schnaith, et al, Proc. Natl. Acad. Sci., USA, 
21:569, 1994),Fe+3,Ni+2,Ni+3,La+3,Eu+3 (Hall, etal, Chemistry and Biology, 1:185, 
1994), Gd+3, Tb+3, Lu + 3 Mn+2, Mg+2. Cleavage with chelated metals is generally 
25 performed in the presence of oxidizing agents such as 0 2 , hydrogen peroxide HA and 
reducing agents such as thiols and ascorbate. The site and strand (+ or - site) of cleavage 
is determined empirically (Mack, et al, J. Am. Chem. Soc, JJ0:7572, 1988) and » 
dependent on the position of the Cys between the Met and the Tyr preceding the first 
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finger. In the protein Met (AA) Tyr-(Zif)i-]2> the chelate becomes Met-(AA) xl Cys- 
Chelate-(AA) x2 -Tyr-(Zif),., 2 , where AA = any amino acid and x = the number of amino 
acids. Dimeric zif constructs of the type Zif-Jun/Zif-Fos are preferred for cleavage at two 
sites within the target oligonucleotide or at a single long target site. In the case where 
5 double stranded cleavage is desired, both Jun and Fos containing proteins are labelled 
with chelators and cleavage is performed by methods known to those skilled in the art. 
In this case, a staggered double-stranded cut analogous to that produced by restriction 
enzymes is generated. 

Following mutagenesis and selection of variants of the Zif268 protein in which the finger 
1 0 1 specificity or affinity is modified, proteins carrying multiple copies of the finger may 
be constructed using the TGEKP linker sequence by methods known in the art. For 
example, the C7 finger may be constructed according to the scheme: 

MKLLEPYACPVESCDRRFSKSADLKRHIRHTGEKP- 

(YACPVESCDRRFSKSADLKHIRIHIQEKE) ,.„, (SEQ ID NO: 39) where the sequence 
15 of the last linker is subject to change since it is at the terminus and not involved in linking 
two fingers together. This protein binds the designed target sequence GCG-GCG-GCG 
(SEQ ID NO: 32) in the oligonucleotide hairpin CCT-CGC-CGC-CGC-GGG-TTT-TCC- 
CGC-GCC-CCC GAG G (SEQ ID NO: 40) with an affinity of 9nM, as compared to an 
affinity of 300 nM for an oligonucleotide encoding the GCG-TGG-GCG sequence (as 
20 determined by surface plasmon resonance studies). Fingers utilized need not be identical 
and may be mixed and matched to produce proteins which recognize a desired target 
sequence. These may also be utilized with leucine zippers (e.g., Fos/Jun) or other 
heterodimers to produce proteins with extended sequence recognition. 

In addition to producing polymers of finger 1, the entire three finger Zif268 and modified 
25 versions therein may be fused using the consensus linker TGEKP to produce proteins with 
extended recognition sites. For example, the protein Zif268-Zif268 can be produced in 
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whic h *. nam., prorein has been fused ,0 itself using the TGEKP linker. This protein 
now hinds the sentence GCG-TGG-GCG-GCG-TGG.GCG. Therefore mod,f.ca«ons. 
^n .he three fingers of Z,Bo8 or other zinc flnger proteins known in the ar. may be 
teedrogetherroformaprorein which .cognizes extended sequences. These newzmc 
6 proteins tnay also be used in combination with leucine zippers tf deaued. 

^.various changes and modifications can he made wiftout departing from the s P tn, 
or scope of the invention. 
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A recombinant polypeptide containing three of nine of the TFIIIA zinc fingers (Clemens, 
et ai, Proc. Nat'l Acad Set, USA, 32:10822, 1992) has been generated by polymerase 
chain reaction (PCR) amplification from the cDNA for TFIIIA and expression in E. coli. 
5 The recombinant protein, termed zfl -3, was purified by ion exchange chromatography 
and its binding site within the 5S gene was determined by a combination of DNase 1 
footprinting and binding to synthetic oligonucleotides (Liao, et al t J. Mol Biol., 223:857, 
1992). The examples provide experiments which show that the binding of this 
polypeptide to its recognition sequence placed close to an active RNA polymerase 

10 promoter could inhibit the activity of that promoter in vitro. To provide such a test 
system, a 26 bp oligonucleotide containing the 13 bp recognition sequence for zfl -3 was 
cloned into the polylinker region of plasmid pUC19 near the promoter sequence for T7 
RNA polymerase. The DNA binding activity of our preparation of recombinant zfl -3 was 
determined by gel mobility shift analysis with the oligonucleotide containing the binding 

1 5 site. In addition, in vitro transcription was performed with T7 RNA polymerase in the 
presence or absence of the same amounts of the zfl -3 polypeptide used in the DNA 
binding titration. For each DNA molecule bound by zfl -3, that DNA molecule is 
rendered inactive in transcription. In these examples, therefore, a zinc finger polypeptide 
has been produced which fully blocked the activity of a promoter by binding to a nearby 

20 target sequence. 
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FX AMPLE 1 
SFginrN-rr-SPECIFIC GENE TAB£Eim£ 
pv 7Tisr EfflGEB proteins 

A From the crystal structure of zif268, it is clear that specific histidine (non-zinc 
coordinating his residues) and arginine residues on the surface of the a-helix, the finger 
tip and at helix positions 2, 3, and 6 (immediately preceding the conserved htsUdme) 
participate in hydrogen bonding to DNA guanines. As the number of structures of zinc 
finger complexes continues to increase, it will be likely that different amino adds and 
different positions may participate in base specific recognition. FIGURE 2 (panel A) 
shows the sequence of the three ammo-tenninal fmgers of TFIH A with basic amino actds 
at these positions underlined. Similar to finger 2 of the regulatory protein zif268 (Krox- 
20) and fingers 1 and 3 of Spl, finger 2 ofTFUIA contains histidine and arginine rescues 
at these DNA contact positions; further, each of these zinc fingers minimally recognizes 
the sequence GGG (FIGURE 2, panel B) within the 5S gene promoter. 

Arecombmampo^ 

by polymerase chain reaction (PGR) amplification from the cDNA for TFIIIA and 
expression in E. coli (Clemens, et al, supra). An experiment was des lg ned to determme 
whether the binding of this polypeptide to its recognition sequence, placed close to an 
active RNA polymerase promoter, would inhibit the activity of that promoter in *m. 
The following experiments were done to provide such a test system. A 23 bp 
oligonucleotide (Liao,,r a /., 1992, .^containing the 13 bp recogrution sequence o^ 
rf.Swas cloned into the polylinker region ofplasmidpBluescriptSK^Stratagene.La 
Jolla, C A), near the promoter sequence for T7 RNA polymerase. The parent plasmid was 
digested with the reaction enzyme EcoRW and, after dephosphorylation with calf 
intestmalalkalmephosphatase^ephosphoryl^ 

ligation with T4 DNA ligase. The ligation product was used for transformation ofDH5o 
E. coli cells. Clones harboring 23 bp inserts were identified by restriction digests of 
tniniprep DNA. The success of cloning was also verified by DNA sequence analyst, 
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The DNA binding activity of the preparation of recombinant zfl -3 was also determined 
by gel mobility shift analysis with a 56 bp radiolabeled EcoRI/XhoI restriction fragment 
derived from the cone containing the binding site for zfl -3 and with the radiolabeled 23 
bp oligonucleotide. Gel shift assays were done as described (Liao, et al, supra; Fried, et 

5 al, Nucl. Acids., Res., 9_:6505, 1981). The result of the latter analysis is shown in 
FIGURE 3. Binding reactions (20 ^1) also contained 1 tig of unlabeled plasmid DNA 
harboring the same 23 bp sequence. In lanes 2-12, the indicated amounts of zfl -3 were 
also included in the reactions. After incubation at ambient temperature for 30 min, the 
samples were subjected to electrophoresis on a 6% nondenaturing polyacrylamide gel in 

1 0 88mM Tris-borate, pH 8.3, buffer. In each reaction, a trace amount of the radiolabeled 
oligonucleotide was used with a constant amount (1 ^g) of plasmid DNA harboring the 
zfl -3 binding site. The reactions of lanes 2-12 contained increasing amounts of the zfl -3 
polypeptide. The autoradiogram of the gel is shown. The results indicate that binding of 
zfl -3 to the radiolabeled DNA caused a retardation of electrophoretic mobility. The 

1 5 percentage of radiolabeled DNA molecules bound by zfl -3 also reflects the percentage 
of unlabeled plasmid DNA molecules bound. 

In vitro transcription experiments were performed with T7 RNA polymerase in the 
presence or absence of the same amounts of the zfl -3 polypeptide used in the DNA 
binding titration with identical amounts of the plasmid DNA harboring the zfl -3 binding 

20 site. Each reaction contained, in a volume of 25 ^1, 1 Mg of PvwII-digested pBluescript 
SK+DNA containing the 23 bp binding site for zfl -3 inserted in the EcoRV site of the 
vector, 40 units of RNasin, 0.6 mM ATP+UTP+CTP, 20 vM GTP and 10 M Ci of o- 32 P- 
GTP and 10 units of T7 RNA polymerase (Stratagene). The reaction buffer was provided 
by Stratagene. After incubation at 37°C for 1 hour, the products of transcription were 

25 purified by phenol extraction, concentrated by ethanol precipitation and analyzed on a 
denaturing polyacrylamide gel. T7 transcription was monitored by the incorporation of 
radioactive nucleotides into a run-off transcript. FIGURE 4 shows an autoradiogram of 
a denaturing polyacrylamide gel analysis of the transcription products obtained. In this 
experiment, the plasmid DNA was cleaved with the restriction enzyme PvuW and the 
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expected length of the run-off transcript was 245 bases. Addition of zfl-3 polypeptide 
to the reaction repressed transcription by T7 RNA polymerase. 

FIGURE 5 shows a graph in which the percentage of DNA molecules bound by zfl-3 in 
the DNA gel mobility shift assay (x-axis) versus the percentage of inhibition of T7 RNA 
polymerase transcription by the same amounts of zfl-3 (y-axis) has been plotted. Note 
thateach data point corresponds to identical amounts of zfl-3 used in the two assays. The 
one-to-one correspondence of the two data sets is unequivocal. T7 transcription was 
monitored by the incorporation of radioactive nucleotides into a run-off transcnpt. 
Transcription was quantitated by gel electrophoresis, autoradiography and densitometry. 
Gel mobility shift assays were quantitated in a similar fashion. For each DNA molecule 
bound by zfl-3, that DNA molecule is rendered inactive in transcription. In this 
experiment, therefore, a zinc finger polypeptide has fully blocked the activity of a 
promoter by binding to a nearby target sequence. 

B Since the previous experiment was performed with a prokaryotic RNA polymerase, 
the following experiment was performed to determine whether the zinc finger polypept.de 
zfl-3 could also block the activity of a eukaryotic RNA polymerase. To test this, a 
transcription extract prepared from unfertilized Xenopus eggs (Hartl, et al, J. Cell Biol, 
12Q/613, 1993) and the Xenopus 5S RNA gen, template was used. These extracts are 
highly active in transcription of 5S RNA and tRNAs by RNA polymerase III. As a test 
template, the 5S RNA gene which naturally contains the binding sites for TFIIIA and zfl - 
3 was used. Each reaction contained 10 »\ of a high speed supernatant of the egg 
h'omogenate, 9 ng of TFIIIA, nucleoside triphosphates (ATP, UTP, CTP) at 0.6 mM and 
1 0„Ci of a-*P-GTP and GTP at 20 ,M in a 25 „1 reaction. All reactions contamed 1 80 
ng of a plasmid DNA harboring a single copy of the Xenopus somatic-type 5S RNA gene, 
and the reactions of lanes 2 and 3 also contained 300 ng of a Xenopus tRNAmet gene- 
containing plasmid. Prior to addition of the Xenopus egg extract and TFIIIA, 0.2 and 0.4 
„g of zfl-3 were added to the reactions of lanes 2 and 3, respectively. The amount of zfl- 
3 used in the experiment of lane 2 was sufficient to bind all of the 5S gene-contammg 
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DNA in a separate binding reaction. After a 1 5 min. incubation to allow binding of zfl-3 
to its recognition sequence, the other reaction components were added. After a 2 hour 
incubation, the products of transcription were purified by phenol extraction, concentrated 
by ethanol precipitation and analyzed on a denaturing polyacrylamide gel. The 

5 autoradiogram is shown in FIGURE 6. FIGURE 6 also shows the result of a controlled 
reaction in which no zinc finger protein was added (lane 1). As a control, lanes 2 and 3 
also contained a tRNA gene template, which lacks the binding site for TFIIIA and zfl-3. 
5S RNA transcription was repressed by zfl-3 while tRNA transcription was unaffected. 
These results demonstrate that zfl-3 blocks the assembly of a eukaryotic RNA 

10 polymerase III transcription complex and shows that this effect is specific for DNA 
molecules that harbor the binding site for the recombinant zinc finger protein derived 
from TFIIIA. 

Three-dimensional solution structures have been determined for a protein containing the 
first three zinc fingers of TFIIIA using 2D, 3D, and 4D NMR methods. For this purpose, 

15 the protein was expressed and purified from E. coli and uniformly labeled with 13 C and 
13 N. The NMR structure shows that the individual zinc fingers fold into the canonical 
finger structure with a small p-sheet packed against an a-helix. The fingers are not 
entirely independent in solution but there is evidence of subtle interactions between them. 
Using similar techniques the 3D structure of a complex between zf 1-3 and a 13bp 

20 oligonucleotide corresponding to its specific binding site on the 5S RNA gene is 
determined and used to provide essential information on the molecular basis for sequence- 
specific nucleotide recognition by the TFIIIA zinc fingers. This information is in turn 
used in designing new zinc finger derived-nucleotide binding proteins for regulating the 
preselected target genes. Similar NMR methods can be applied to determine the detailed 

25 structures of the complexes formed between designed zinc finger proteins and their target 
genes as part of a structure-based approach to refine target gene selectivity and enhance 
binding affinity. 
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ln ori e, ,0 rap* son iaage nbmties o f * finger 
5 system UMr sloped for antibody M. (Barbas, « A M7B0BS W W W 
rid To this end, pCo m b3 has been modified foe *, fnger setectton^ Tbe 

v«,oe nCnmb3 5 The zif268 three finger protein has been tnod,fied by PCR and 

. 5 . * - « — - *• ph - : 

J! floa Site-ditected .uresis has been performed to inset an W « 
bleenfmgers , and 2 in order to facility library cot—. Fonhermore, asf2 8 
beween fingers l , pcaamtideta , whicha llo»a its binding to be conveniently 

r^^t-r^.^utcreatefing^^^^ 

« Ute amino tetmma, aide of the . he., mvoived in recognition ™~ 
HNKdophtgs^egytopmvide degeneracy. Wsd^ finger ongu^y^^GCG 
^snbsitl Selection for binding ,o an AAA snbsite reveaied a eonsensns panent 
appearing in the selected sequences. 
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The zif268 containing plasmid, pZif89 (Pavletich, et al, Science 252:809, 1991), was 
used as the source of zif268 DNA for modification of the zinc fingers. Briefly, pZif89 
was cloned into the plasmid, pComb3.5, after amplification by PCR using the following 
primers: 

ZF: 5'-ATG AAA CTG CTC GAG CCC TAT GCT TGC CCT GTC GAG-3' 
(SEQUENCE ID NO. 2) 

ZR: 5'-GAG GAG GAG GAG ACT AGT GTC CTT CTG TCT TAA ATG GAT TIT 
GGT-3' (SEQUENCE ID NO. 3). 

The PCR reaction was performed in a 100^1 reaction containing 1 n% of each of 
oligonucleotide primers ZF and ZR, dNTPs (dATP, dCTP, dGTP, dTTP), 1.5mM MgCla 
Taq polymerase (5 units) 10 ng template pZif89, and 10^1 10 x PCR buffer (Perkin - 
Elmer Corp.). Thirty rounds of PCR amplification in a Perkin - Elmer Cetus 9600 Gene 
Amp PCR system thermocycler were performed. The amplification cycle consisted of 
denaturing at 94°C for one minute, annealing at 54°C for one minute, followed by 
extension at 72°C for two minutes. The resultant PCR amplification products were gel 
purified as described below and digested with XhoVSpel and ligated into pComb3.5. 
pComb3.5 is a variant of pComb3 (Barbas, et al, Proc. Natl. Acad. ScL, USA, 8S-7978, 
1991) which has the light chain region, includ ; ig its lacZ promoter, removed. Briefly, 
pComb3 was digested with Nhel, klenow treated, digested with Xbal, and religated to 
form pComb3.5. Other similar vectors which could be used in place of pComb3.5, such 
as Surf Zap™ (Stratagene, La Jolla, CA), will be known to those of skill in the art. 

The phagemid pComb3.5 containing zif268 was then used in PCR amplifications as 
described herein to introduce nucleotide substitutions into the zinc fingers of zif268, to 
produce novel zinc fingers which bind to specific recognition sequences and which 
25 enhance or repress transcription after binding to a given promoter sequence. 
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The methods of producing nove! zinc fingers with particular sequence recognition 
specifrciry and regulation of gene expression capabilities involved the following steps: 

,. A first zinc finger (e.g., Zinc finger 3 of ziQM) was firs, randomized through me 
use of overlap PCR; 

5 2. Amplification products from the overlap PCR containing randomized zinc fingers 
were ligated back into P Comb3.5 to form a randomized library; 

3 Following expression of bacteriophage coat protein Ill-anchored zinc finger from 
the library, the surface protein expressing phage were panned against specific zmc 
finger recognition sequences, resulting in the selection of several specific 

1 0 randomized zinc fingers; and 

4 Following selection of sequence-specific zinc fingers, the corresponding phagemids 
were sequenced and the amino acid residue sequence was derived therefrom. 

F.XAMPLE3 

gREPAB ^TT™ QE-BAJSBQM1ZED 71HC FINGERS 



15 



20 



Torandomizemezbcfingersofz^ 

amplifications were performed for each finger as described herein, followed by a third 
overiapPCRamphfi^ 

products, followed by a third amplification. The nucleotide sequence of zinc finger of 
^68oftemplate P Comb3^ 

4 The nucleotide positions that were randomized in zinc finger 3 began at nucleot.de 
position217andendedatposition237,exc,udingserine. The template zif268 sequence 
atu.tspecifiedsiteencodedeighttotalammoacidresiduesinfingerS. Thisammoac.d 
residue sequence of finger 3 in P Comb3.5 which was to be modified is Mg-Ser-A.p-Qiu- 
Aj£-Ly§-Arg-His (SEQUENCE ID NO. 5). The underlined amino acids represent those 
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residues which were randomized. 

A pool of oligonucleotides which included degenerate oligonucleotide primers, 
designated BZF3 and ZF36K and non-degenerate primers R3B and FTX3 having the 
nucleotide formula described below, (synthesized by Operon Technologies, Alameda, 
5 CA), were vised for randomizing the zinc finger 3 of zi£268 in pComb3.5. The six triplet 
codons for introducing randomized nucleotides included the repeating sequence NNM 
(complement of NNK), where M can be either G or C and N can be A, C, G or T. 

The first PCR amplification resulted in the amplification of the 5' region of the zinc finger 
3 fragment in the pComb3.5 phagemid vector clone. To amplify this region, the 

10 following primer pairs were used. The 5' oligonucleotide primer, FTX3, having the 
nucleotide sequence 5'-GCA ATT AAC CCT CAC TAA AGG G-3' (SEQUENCE ID 
NO. 6), hybridized to the noncoding strand of finger 3 corresponding to the region 5' 
(including the vector sequence) of and including the first two nucleotides of zif268. The 
y oligonucleotide primer, BZF3, having the nucleotide sequence 5 r -GGC AAA CTT CCT 

1 5 CCC ACA AAT-3' (SEQUENCE ID NO. 7) hybridized to the coding strand of the finger 
3 beginning at nucleotide 216 and ending at nucleotide 196. 

The PCR reaction was performed in a 100 microliter (ul) reaction containing one 
microgram (ug) of each of oligonucleotide primers FTX3 and BZF3, 200 millimolar 
(mM) dNTP's (dATP, dCTP, dGTP, dTTP), 1.5 mM MgCl 2 Taq polymerase (5 units) 

20 (Perkin-Elmer Corp., Norwalk, CT), 10 nanograms (ng) of template pComb3.5 zif268, 
and 10 ul of 10X PCR buffer purchased commercially (Perkin-Elmer Corp.). Thirty 
rounds of PCR amplification in a Perkin-Elmer Cetus 9600 GeneAmp PCR System 
thermocycler were then performed. The amplification cycle consisted of denaturing at 
94°C for 30 seconds, annealing at 50°C for 30 seconds, followed by extension at 72°C 

25 for one minute. To obtain sufficient quantities of amplification product, 30 identical PCR 
reactions were performed. 
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The result PCR amplification produce were then ge. purified on = ..5% agarose gei 
mixt% srandard electroelution techniques as described in "Molacubr Cloning: A 
Laboratory Manna!", SambroOK, e, a,., ads.. Co,d Spring Harbor, NV (.989). Bnefly 
after get electrophoresis of the digested PCR amplified zinc finger domain, the regton of 
5 *. ge! containing the DNA fragments of predetermined size was excised, electroeluted 
into a dialysis membrane, ethano, precipitated and resuspended in buffer contanung 10 
mM Tris-HCl, pH 7.5 and 1 mM EDTA to a final concentration of 50 ng/ml. 

The purified resultant PCR amplication products ftom the firs, reaction were then used 
h, an overlap extension PCR reaction win, Ore products of the second PCR reaction, both 
, o as described below, to recombme the two products into reconstructed zi,268 contatmng 
randomized zinc fingers. 

The second PCR reaction resulted in the amplification of the 3' end of zif268 finger 3 
overlapping with tire above products and extending 3' of finger 3. To amplify tins regron 

,, primer pairs were used. The 5' coding oligonucleotide primer pool was deserted 
2F36K and had the nucleotide sequence represented by the formula, 5 -ATT TGT GGG 
AGG AAG TTT CCC NNK ACT NNK NNK NNK NNK NNK CAT ACC AAA ATC 
CAT TTA-3' (SEQUENCE ID NO. 8) (nucleotides 196-255). The V noncoding pnmer, 
R3B hybridized to dte coding strand a. me 3' end of gene 111 (gin) having the sequence 
20 5'-TTG ATA TTC ACA AAC GAA TGG-3' (SEQUENCE ID NO. 9). The regton 
" between tite » specified ends of the primer poo. is represented by a 15-mer NNK 
degeneracy. The second PCR reaction was performed on a second aliquot of P Comb3.5 
template in a 100 ul reaction as described above containing 1 ng of each of 
origonucleotide prime, as described. The resultant PCR products encoded a drverse 
25 population of randomized ziC68 finger 3 regions of 8 amino acid residues in lengtir. Tne 
products were then gel purified as described above. 
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For the annealing reaction of the two PCR amplifications, 1 ng each of gel purified 
products from the first and second PCR reactions were then admixed and fused in the 
absence of primers for 35 cycles of PCR as described above. The resultant fusion product 
was then amplified with I ug each of FTX3 and R3B oligonucleotide primers as a primer 
5 pair in a final PCR reaction to form a complete zif268 fragment by overlap extension. 
The overlap PCR amplification was performed as described for other PCR amplifications 
above. 

To obtain sufficient quantities of amplification product, 30 identical overlap PCR 
reactions were performed. The resulting fragments extended from 5' to 3' and had 
randomized finger 3 encoding 6 amino acid residues. The randomized zif268 
amplification products of approximately 450 base pairs (bp) in length in each of the 30 
reactions were first pooled and then gel purified as described above and cut with Xhol and 
Spel, prior to their relegation into the pComb3.5 surface display phagemid expression 
vector to form a library for subsequent screening against zinc finger recognition sequence 
oligos for selection of a specific zinc finger. The ligation procedure in creating 
expression vector libraries and the subsequent expression of the zi£268 randomized 
pComb3.5 clones was performed as described below in Example 4. 

Nucleotide substitutions may be performed on additional zinc fingers as well. For 
example, in zif268, fingers 1 and 2 may also be modified so that additional binding sites 
20 may be identified. For modification of zinc finger 2, primers FTJC3 (as described above) 
and ZFNsi-B, S'-CAT GCA TAT TCG ACA CTG GAA-3 1 (SEQUENCE ID NO. 10) 
(nucleotides 100-120) are used for the first PCR reaction, and R3B (described above) and 
ZF2r6F (5'-C AG TGT CGA ATA TGC ATG CGT AAC TTC (NNK) 6 ACC ACC CAC 
ATCCGC ACC CAC-3') (SEQUENCE ID NO. 11) (nucleotides 103 to 168) are used for 
25 the second reaction. For modification of finger 1, RTX3 (above) and ZFI6rb (5'-CTG 
GCC TGT GTG GAT GCG GAT ATG (MNN) 5 CGA MNN AGA AAA GCG GCG ATC 
GCA GGA-3') (SEQUENCE ID NO. 12) (nucleotides 28 to 93) are used for the first 
reaction and ZFIF (5'-CAT ATC CGC ATC CAC ACA GGC CAG-3') (SEQUENCE ID 
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NO. 13) (nucleotide 70 to 93) and R3B (above) are used in the second reaction. The 
overlap reaction utilizes FTX3 and R3B as described above for finger 3. Preferably, each 
finger is modified individually and sequentially on one protein molecule, as opposed to 
all three in one reaction. The nucleotide modifications of finger 1 of zif268 would 
5 include the underlined amino acids R S ElE-LXE H, (SEQUENCE ID NO. 14) which 
is encoded by nucleotides 49 to 72. The nucleotide modifications of finger 2 of zif268 
would include S R S D H L (SEQUENCE ID NO. 15), which is encoded by nucleotides 
130 to 147. (See FIGURE 7). 

EXAMPLE 4 

10 PREPARATION OF pH Ar.FMIT)-DISPT AYFD SEQUENCES 

T1AVTNH RANDOMIZED 7,!NC FINGERS 

The phagemid P Comb3.5 containing zif268 sequences is a phagemid expression vector 
that provides for the expression of phage-displayed anchored proteins, as described above. 
The original pComb 3 expression vector was designed to allow for anchoring of expressed 

1 5 antibody proteins on the bacteriophage coat protein 3 for the cloning of combinatorial Fab 
libraries. Xhol and Spel sites were provided for cloning complete PCR-amplified heavy 
chain (Fd) sequences consisting of the region beginning with framework 1 and extending 
through framework 4. Gene III of filamentous phage encodes this 406-residue minor 
phage coat protein, cpHI (cp3), which is expressed prior to extrusion in the phage 

20 assembly process on a bacterial membrane and accumulates on the inner membrane facing 
into the periplasm of E. colt. 

In this system, the first cistron encodes a periplasmic secretion signal (pelB leader) 
operatively linked to the fusion protein, zif268-cpIII. The presence of the pelB leader 
facilitates the secretion of both the fusion protein containing randomized zinc finger from 
25 the bacterial cytoplasm into the periplasmic space. 
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By this process, the zif268-cpIII was delivered to the periplasmic space by the pelB leader 
sequence, which was subsequently cleaved. The randomized zinc finger was anchored 
in the membrane by the cpIII membrane anchor domain. The phagemid vector, 
designated pComb3.5, allowed for surface display of the zinc finger protein. The 
5 presence of the XhoVSpel sites allowed for the insertion of Xhol/Spel digests of the 
randomized zif268 PCR products in the pComb3.5 vector. Thus, the ligation of the 
zif268 mutagenized nucleotide sequence prepared in Example 3 resulted in the in-frame 
ligation of a complete zif268 fragment consisting of PCR amplified finger 3. The cloning 
sites in the pComb3.5 expression vector were compatible with previously reported mouse 
10 and human PCR primers as described by Huse, et al, Science, 24£:1275-1281 (1989)and 
Persson, et al, Proc. Natl. Acad Set, USA, 88_:2432-2436 (1991). The nucleotide 
sequence of the pelB, a leader sequence for directing the expressed protein to the 
periplasmic space, was as reported by Huse, et al, supra. 

The vector also contained a ribosome binding site as described by Shine, et al, Nature, 
254:34, 1975). The sequence of the phagemid vector, pBluescript, which includes ColEl 
and Fl origins and a beta-lactamase gene, has been previously described by Short, et al, 
Nuc. Acids Res., j£:7583-7600, (1988) and has the GenBank Accession Number 52330 
for the complete sequence. Additional restriction sites, Sail, Acc\, Hindi, CM, Hindlll, 
EcoKV, Pstl and Smal, located between theXhol and Spel sites of the empty vector were 
derived from a 51 base pair sniffer fragment of pBluescript as described by Short, et al, 
supra. A nucleotide sequence that encodes a flexible 5 amino acid residue tether 
sequence which lacks an ordered secondary structure was juxtaposed between the Fab and 
cp3 nucleotide domains so that interaction in the expressed fusion protein was minimized. 

Thus, the resultant combinatorial vector, pComb3.5, consisted of a DNA molecule having 
25 a cassette to express a fusion protein, zif268/cp3 . The vector also contained nucleotide 
residue sequences for the following operatively linked elements listed in a 5' to 3' 
direction: the cassette consisting of LacZ promoter/operator sequences; a Notl restriction 
site; a ribosome binding site; a pelB leader; a spacer region; a cloning region bordered by 
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10 



5- Xhol and 3' Spel restriction sites; the tether sequence; and the sequences encoding 
bacteriophage cp3 followed by a stop codon. A Nhel restriction site located between the 
original two cassettes (for heavy and light chains); a second lacZ promoter/operator 
sequence followed by an expression control ribosome binding site; a pelB leader; a spacer 
region; a cloning region bordered by 5' Sad and a 3' Xbal restriction sites followed by 
expression control stop sequences and a second Natl restriction site were deleted from 
P Comb3 to form pComb 3.5. Those of skill in the art will know of similar vectors that 
could be utilize in the method of the invention, such as the Surf Zap™ vector (Stratagene, 
LaJolla, CA.). 

In the above expression vector, the zif268/c P 3 fusion protein is placed under the control 
of a lac promoter/operator sequence and directed to the periplasmic space by pelB leader 
sequencesforfunctional assembly onthe membrane. Inclusion of the phage Fl intergeruc 
region in the vector allowed for the packaging of single-stranded phagemid with the aid 
ofhelper phage. The use of helper phage superinfection allowed for the expression of two 
forms of cp3. Consequently, normal phage morphogenesis was perturbed by competition 
between the Fd7cp3 fusion and the native cp3 of the helper phage for incorporation into 
the virion. The resulting packaged phagemid carried native c P 3, which is necessary for 
infection, and the encoded fusion protein, which is displayed for selection. Fusion with 
the C-terminal domain was necessitated by the phagemid approach because fusion with 
20 the infective N-terminal domain would render the host cell resistant to infection. 

The P Comb3 and 3.5 expression vector described above forms the basic construct of the 
display phagemid expression vector used in this invention for the production of 
randomized zinc finger proteins. 
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F.XAMPLE 5 
E HAGEMffl LIBBABY CONSTRUCTION 

In order to obtain expressed protein representing randomized zinc fingers, phagemid 
libraries were constructed. The libraries provided for surface expression of recombinant 
5 molecules where zinc fingers were randomized as described in Example 3. 

For preparation of phagemid libraries for expressing the PCR products prepared in 
Example 3, the PCR products were first digested with Xhol and Spel and separately 
ligated with a similarly digested original (i.e., not randomized) pComb3.5 phagemid 
expression vector. The Xhol and Spel sites were present in the pComb3.5 vector as 

10 described above. The ligation resulted in operatively linking the zif268 to the vector, 
located 5' to the cp3 gene. Since the amplification products were inserted into the 
template P Comb3.5 expression vector that originally had the heavy chain variable domain 
sequences, only the heavy chain domain cloning site was replaced leaving the rest of the 
pComb3.5 expression vector unchanged. Upon expression from the recombinant clones, 

15 the expressed proteins contained a randomized zinc finger. 

Phagemid libraries for expressing each of the randomized zinc fingers of this invention 
were prepared in the following procedure. To form circularized vectors containing the 
PCR product insert, 640 ng of the digested PCR products were admixed with 2 ug of the 
linearized P Comb3.5 phagemid vector and ligation was allowed to proceed overnight at 

20 room temperature using 10 units of BRL ligase (Gaithersburg, MD) in BRL ligase buffer 
in a reaction volume of 150 ul. Five separate ligation reactions were performed to 
increase the size of the phage library having randomized zinc fingers. Following the 
ligation reactions, the circularized DNA was precipitated at -20°C for 2 hours by the 
admixture of 2 ul of 20 mg/ml glycogen, 15 ul of 3 M sodium acetate at pH 5.2 and 300 

25 ul of ethanol. DNA was then pelleted by microcentrifugation at 4'C for 15 minutes. The 
DNA pellet was washed with cold 70% ethanol and dried under vacuum. The pellet was 
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resuspended in 10 ul of water and transformed by electroporation into 300 ul of E. coli 
XL 1 -Blue cells to form a phage library. 

After transformation, to isolate phage expressing mutagenized finger 3, phage were 

induced as described below for subsequent panning on a hairpin oligo having the 

following sequence (SEQUENCE ID NO. 16): 

NH,-CGT-AAA-2££-S£2-CCC - T 

T 
T 

GCA - TTT - ACC - CGC - GGG - T 
The bold sequence indicates the new zinc finger 3 binding site (formerly GCG), the 
underlined sequence represents the finger 2 site and the double underlining represents the 
finger 1 binding site. 

Transformed E. coli were grown in 3 ml of SOC medium (SOC was prepared by 
admixture of 20 grams (g) bacto-tryptone, 5 g yeast extract and 0.5 g NaCl in 1 liter of 
water, adjusting the pH to 7.5 and admixing 20 ml of glucose just before use to induce the 
expression of the zif268-cpIII), were admixed and the culture was shaken at 220 rpm for 
1 hour at 37°C. Following this incubation, 1 0 ml of SB (SB was prepared by admixing 
30 g tryptone, 20 g yeast extract, and 10 g Mops buffer per liter with pH adjusted to 7) 
containing 20 ug/ml carbenicillin and 10 ug/ml tetracycline were admixed and the 
admixture was shaken at 300 rpm for an additional hour. This resultant admixture was 
admixed to 100 ml SB containing 50 ug/ml carbenicillin and 10 ug/ml tetracycline and 
shaken for 1 hour, after which helper phage VCSM13 (10" pfu) were admixed and the 
admixture was shaken for an additional 2 hours at 37»C. After this time, 70 ug/ml 
kanamycin was admixed and maintained at 30'C overnight. The lower temperature 
resulted in better expression of zif268 on the surface of the phage. The supernatant was 
cleared by centrifugation (4000 rpm for 15 minutes in a JA10 rotor at 4'C). Phage were 
precipitated by admixture of 4% (w/v) polyethylene glycol 8000 and 3% (w/v) NaCl and 
maintained on ice for 30 minutes, followed by centrifugation (9000 rpm for 20 minutes 
i JA10 rotor at 4'C). Phage pellets were resuspended in 2 ml of buffer (5mM DTT, 
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lOmMTris-HCl, pH 7.56, 90mM KC1, 90mM ZnCl 2 , ImM MgCl 2 and microcentrifiiged 
for three minutes to pellet debris, transferred to fresh tubes and stored at -20 °C for 
subsequent screening as described below. DTT was added for refolding of the 
polypeptide on the phage surface. 

5 For determining the titering colony forming units (cfu), phage (packaged phagemid) were 
diluted in SB and 1 ul was used to infect 50 ul of fresh (Aod6oo = 1) ILcoH XLl-Blue cells 
grown in SB containing 10 ug/ml tetracycline. Phage and cells were maintained at room 
temperature for 15 minutes and then directly plated on LB/carbenicillin plates. The 
randomized zinc finger 3 library consisted of 5 x 10 7 PFU total. 

10 Multiple Pannings of the Phage Library 

The phage library was panned against the hairpin oligo containing an altered binding site, 
as described above, on coated microtiter plates to select for novel zinc fingers. 

The panning procedure used, comprised of several rounds of recognition and replication, 
was a modification of that originally described by Parmley and Smith (Parmley, et al., 

15 Gene, 73:305-318, 1988; Barbas, et al, 1991, supra.). Five rounds of panning were 
performed to enrich for sequence-specific binding clones. For this procedure, four wells 
of a microtiter plate (Costar 3690) were coated by drying overnight at 37°C with 1/ig the 
oligo or the oligo was covalently attached to BSA with EDC/NHS activation to coat the 
plate (360 ^g acetylated BSA (Boehringer Manheim), 577 fig oligo, 40mM NHS, and 

20 lOOmM EDC were combined in 1.8 ml total volume and incubated overnight at room 
temperature. The plates were coated using 50/il per plate and incubated at 4°C overnight. 
The wells were washed twice with water and blocked by completely filling the well with 
3% (w/v) BSA in PBS and maintaining the plate at 37°C for one hour. After the blocking 
solution was shaken out, 50 ul of the phage suspension prepared above (typically 10 12 pfu) 

25 were admixed to each well, and the plate was maintained for 2 hours at 37°C. 
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Phage were removed and the plate was washed once with water. Each well was then 
washed 10 times with TBS/Tween (50 mM Tris-HCl at pH 7.5, 150 mM NaCl, 0.5% 
Tween 20) over a period of 1 hour at room temperature where the washing consisted of 
pipetting up and down to wash the well, each time allowing the well to remain completely 

5 filled with TBSAween between washings. The plate was washed once more with 
distilled water and adherent phage were eluted by the addition of 50 ul of elution buffer 
(0 1 M HC1, adjusted to pH 2.2 with solid glycine, containing 1 mg/ml BSA) to each well 
followed by maintenance at room temperature for 10 minutes. The elution buffer was 
pipetted up and down several times, removed, and neutralized with 3 ul of 2 M Tris base 

10 per 50 ul of elution buffer used. 

Eluted phage were used to infect 2 ml of fresh (OD 600 = 1) E. coli XLl-Blue cells for 15 
minutes at room temperature, after which time 10 ml of SB containing 20 ug/ml 
carbenicillin and 10 ug/ml tetracycline was admixed. Aliquots of 20, 10, and 1/10 ul were 
removed from the culture for plating to determine the number of phage (packaged 

15 phagemids) that were eluted from the plate. The culture was shaken for 1 hourat37'C, 
after which it was added to 100 ml ofSB containing 50 ug/ml carbenicillin and lOug/ml 
tetracycline and shaken for 1 hour. Helper phage VCSM13 (10- pfu) were then added 
and the culture was shaken for an additional 2 hours. After this time, 70 ug/ml kanamycm 
was added and the culture was incubated at 37°C overnight. Phage preparation and 

20 further panning were repeated as described above. 

Following each round of panning, the percentage yield of phage were determined, where 
o/o yield = (number of phage eluted/number of phage applied) X 100. The initial phage 
input ratio was determined by titering on selective plates to be approximately 1 0" cfu for 
each round of panning. The final phage output ratio was determined by infecting two ml 
25 of loearithmic phase XLl-Blue cells as described above and plating aliquots on selecuve 
plates. From this procedure, clones were selected from the Fab library for their ability to 
bind to the new binding sequence oligo. The selected clones had randomized zinc finger 
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The results from sequential panning of the randomized zinc finger 3 library revealed five 
binding sequences which recognized the new finger 3 site. The native site, GCG, was 
altered to AAA and the following sequences shown in Table 1 were identified to bind 
AAA. 
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TABLE 1 
BTNP TNr. SEQUENCE 



SEQUENCE ID NO. 17 


RSD ERK RH 1 


SEQUENCE ID NO. 18 


WSI PVL LH 


SEQUENCE ID NO. 19 


WSL LPV LH 


SEQUENCE ID NO. 20 


FSF LLPLH 


SEQUENCE ID NO. 21 


LSTWRGWH 


SEQUENCE ID NO. 22 


TSI QLP YH 



' RSD ERK RH is the native Finger 3 binding sequence. 

,„ order ,0 assess the function., properties of rhe new zinc fingers generated an E co/i 
based in vivo system has heen devised. This sysrent utilizes -wo plasmtds wtth five 
compare replied colEl and ,15. Cytosplanric expression of the zinc finger 
provided by the arabinase promoter in the eolEl plasntid. The p. 5 replica contatntng 
p ,asmid contains a zinc finger binding site in place of the repressor binding sue tn a 
.asmidwhicbczpreaaesuteafiagmcnrof , galaclosidase. Tbeburdingo^ezmcfin^ 
to to site on uae second plastnid sbut,off the producrion of , galaetostdase and thus 
novo, zine finger, can be assessed in this in *. assay for fianction using a eon— 
blue/white section. For ex-ple. in the presence of acinose and lactose, the « 
finger gene is expressed, ua= p-orein product binds ,0 the zinc finger b.„d„g strc «d 
represses Uae lactose prontoler. Wore, no p-galactosidase is produced and wb, 
plaq „es would he present This system which is compare with respect ,0 reshactton 
shea win, pCotntza.5, will faciliuate Ihe rapid characterization of novel finger, 
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Furthermore, this approach could be extended to allow for the genetic selection of novel 
transcriptional regulators. 

Another method of mutagenizing a wild type zinc finger-nucleotide binding protein 
includes segmental shuffling using a PCR technique which allows for the shuffling of 
5 gene segments between collections of genes. Preferably, the genes contain limited 
regions of homology, and at least 15 base pairs of contiguous sequence identity. 
Collections of zinc finger genes in the vector pComb3.5 are used as templates for the PCR 
technique. Four cycles of PCR are performed by denaturation, for example, for 1 min at 
94°C and annealling of 50°C for 15 seconds. In separate experiments PCR is performed 

10 at 94°C 5 1 min, 50°C, 30 sec; 94°, 1 min, 50°, 1 min; 94°, 1 min, 50% 15 sec, 72°, 1 sec. 
All experiments use the same template (a lOng mixture). The experiment is performed 
such that under each condition two sets of reactions are performed. Each set has only a 
top or a bottom strand primer, which leads to the generation of single-stranded DNA's of 
different lengths. For example, FTX3, ZFIF and FZF3 primers may be used in a separate 

1 5 set to give single stranded products. The products from these reactions are then pooled 
and additional 5' and 3' terminal primers (e.g., FTX3 and R3B) are added and the mix is 
subjected to 35 additional rounds of PCR at 94°C, 1 min, 50°, 15 sec, 72°, 1 min 30 sec. 
The resultant mixture may then be cloned by Xho I/Spe I digestion. The new shuffled 
zinc fingers can be selected as described above, by panning a display of zinc fingers on 

20 any genetic package for selection of the optimal zinc-finger collections. This technique 
may be applied to any collection of genes which contain at least 15 bp of contiguous 
sequence identity. Primers may also be doped to a defined extent as described above 
using the NNK example, to introduce mutations in primer binding regions. Reaction 
times may be varied depending on length, of template and number of primers used. 
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FX AMPLE 7 
MODIFICATION OF SPEC IFICITY OF Zif268 

Rea gents. Strains, and Vectors 

Restriction endonucleases were obtained from New England Biolabs or Boehringer 
Mannheim. T4 DNA ligase was the product of GIBCO BRL. Taq polymerase and Vent 
polymerase was purchased from Promega. Heparin-Sepharose CL-6B medium was from 
Pharmacia. Oligonucleotides were from Operon Technologies (Alameda, CA), or 
prepared on a Gene Assembler Plus (Pharmacia LKB) in the laboratory. pZif89 was a gift 
from Drs. Pavletich and Pabo (Pavletich, Science, 252:809-817, 1991). Escherichia coli 
BL21(DE3)pLysS and plasmid pET3a was from Novagen, Escherichia coli XLl-Blue, 
phage VCSM1 3, the phagemid vector P Comb3, and pAraHA are as described (Barbas III, 
et al, Proc. Natl. Acad. Sci. USA, ££7978-7982, 1991; Barbas III, et al, Methods: A 
Companion to Methods in Enzymdlogy, 2:119-124, 1991). 



Plasmid Construction 

Genes encoding wild-type zinc-finger proteins were placed under the control of the 
Salmonella typhimurium araB promoter by insertion of a DNA fragment amplified by the 
polymerase chain reaction (PCR) and containing the wild-type Zif268 gene of pzif89 
(Pavletich, supra) with the addition of multiple restriction sites (Xhol/SacV and 
XbaVSpeY). The resulting plasmid vector was subsequently used for subcloning the 
selected zinc-finger genes for immunoscreening. In this vector the zinc finger protein is 
expressed as a fusion with a hemagglutinin decapeptide tag at its C-terminus which may 
be detected with an anti-decapeptide monoclonal antibody (FIGURE 8A) (Field, et al, 
Mol. & Cell. Biol. &2159-2165, 1988). The Zif268 protein is aligned to show the 
conserved features of each zinc finger. The it-helices and antiparallel p-sheets are 
indicated. Six amino-acid residues underlined in each finger sequence were randomized 
in library constructions. The C-terminal end of Zif268 protein was fused with a fragment 
containing a decapeptide tag. The position of fusion is indicated by an arrow. 
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The phagemid pComb3 was modified by digestion with Nhel and Xbal to remove the 
antibody light chain fragment, filled with Klenow fragment, and the backbone was self- 
ligated, yielding plasmid pComb3.5. The Zif268 PCR fragment was inserted into 
pComb3.5 as above. To eliminate background problems in library construction a 1.1 -kb 
5 nonfunctional stuffer was substituted for the wild-type Zif268 gene using Sacl and Xbal. 
The resulting plasmid was digested by Sacl and Xbal to excise the stuffer and the 
pComb3.5 backbone was gel-purified and served as the vector for library construction. 

Zinc Finger Libraries 

Three zinc-finger libraries were constructed by PCR overlap extension using conditions 

10 previously described in Example 3. Briefly, for finger 1 library primer pairs A (5 f -GTC 
CAT AAG ATT AGC GGA TCC-3') (SEQ. ID NO:29) and Zfl6rb (SEQ. ID NO: 12); 
(where N is A, T, G, or C, and M is A or C), and B (5'-GTG AGC GAG GAA GCG GAA 
GAG-3 f ) (SEQ. ID NO:30) and Zflf (SEQ. ID NO:13) were used to amplify fragments 
of Zif268 gene using plasmid pAia-Zi/268 as a template. Two PCR fragments were 

1 5 mixed at equal molar ratio and the mixture was used as templates for overlap extension. 
The recombinant fragments were then PCR-amplified using primers A and B, and the 
resulting product was digested with Sacl and Xbal and gel purified. For each ligation 
reaction, 280 ng of digested fragment was ligated with 1 .8 fxg of pComb3.5 vector at room 
temperature overnight. Twelve reactions were performed, and the DNA was ethanol- 

20 precipitated and electroporated into E. coli XL 1 -Blue. The libraries of finger 2 and 3 
were constructed in a similar manner except that the PCR primers Zfl6rb and ZF1F used 
in finger 1 library construction were replaced by Zfhsi-B (SEQ. ID NO: 10) and ZF2r6F 
(SEQ. ID NO: 1 1 ) (where K is G or T) for finger 2 library, and by BZF3 (SEQ. ID NO:7) 
and ZF36K (SEQ. ID NO:8) for finger 3 library. In the libraries, six amino-acid residues 

25 corresponding to the a-helix positions -1 , 2, 3, 4, 5, 6 of finger 1 and 3, positions -2,-1, 
1 , 2, 3, 4 of finger 2 were randomized (FIGURE 8 A). 
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fjt Vitm Selection nf Zinc Fingers 

A 34-nucleotide hairpin DNA containing either consensus or altered Zif268 binding site 
was used for zinc-finger selection (FIGURE 8). The consensus binding site is denoted 
as Z268N (5'-CCT Q£Q. ICQ Q£Q CCC TTTT GGG £Q£ ££A QQ£ AGG-3') (SEQ. 
5 ID NO- 31). The altered site for finger 1 is TGT (5'-CCT GCG TGG IQI CCC TTTT 
GGG ACA CAA CGC AGG-3') for finger 2 is TTG (5'-CCT GCG HQ GCG CCC 
TTTT GGG CGC Q6A CGC AGG-3') and for finger 3 is CTG (5'-CCT £ffi TGG GCG 
CCC TTTT GGG CGC CCA £A£ AGG-3'). The oligonucleotide was synthesized with 
a primary n-hexyl amino group at its 5' end. A DNA-BSA conjugate was prepared by 
10 mixing 30 ,M DNA with 3 „M acetylated BSA in a solution containing 100 mM l-(3- 
dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride (EDCI) and 40 mM N- 
hydroxysuccinirnide (NHS) as room temperature for 5-hours or overnight. Ztf268 phage, 
10- colony forming units, in 50 „1 zinc buffer (IQmM Tris-Cl, P H 7.5, 90 mM KC1, 1 
mMMgCl 2 ,90^MZnCl 2 , ImMMgCU and 5 mMDTT) containing 1%BSA wasapphed 
15 t oamicromerwellprecoatedwim4.9MgofDNA-BSAconjugatein25^1PBSbufTer(10 

mM potassium phosphate, 160 mM NaCl, P H 7.4) per well. After 2 hours of incubatmn 
at 37-C the phage was removed and the plate washed once by TBS buffer (50 mM Tns- 
Cl and 150 mM NaCl, P H 7.5) containing 0.5% Tween for the first round of selection. 
The plate was washed 5 times for round 2, and 10 times for further rounds. Bound phage 
20 was extracted with elution buffer (0.1 M HC1, P H 2.2 (adjusted with glycine), and 1% 
BSA), and used in infect E. coli XL 1-Blue cells to produce phage for the subsequent 
selection. 

hmnunflsci^gniag 

Mutant zinc finger genes selected after five or six rounds of panning were subcloned mto 
25 the pAraHA vector using Xhol and Spel restriction sites. Typically, 20 clones were 
screened at a time. Cells were grown at 37«C to late-log phase (OD^O.8-1) in the 6 ml 
SB media (Barbas III, et al, supra) containing 30 „g/ml chloramphenicol. Expressmn 
of zinc-finger proteins was induced with addition of 1% of arabinose. Cells were 
harvested 3 to 12 hours following induction. Cell pellets were resuspended in 600 M l zmc 
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buffer containing 0.5 mM phenylmethylsulfonyl fluoride (PMSF). Cells were lysed with 
6-freeze-thaw cycles and the supernatant was clarified by centrifugation at 12,000g for 
5 minutes. A 50 M l-aliquot of cell supernatant was applied to a microliter well precoated 
with 1 . 1 n% of DNA-BSA conjugate. After 1 hour at 37°C, the plate was washed 1 0 times 
with distilled water, and an alkaline phosphatase conjugated anti-decapeptide antibody 
was added to the plate. After 30 minutes at 37°C, the plate was washed 1 0 times and p- 
nitrophenylphosphate was added. The plate was then monitored with a microplate 
autoreader at 405 nm. 



Overexpression and Purification of Zinc-Finger Proteins 

1 0 Zinc finger proteins were overproduced by using the pET expression system (Studier, et 
al, Methods Enzymol, IM:60-89, 1990). The Zif268 gene was introduced following 
PCR into Ndel and BamlU digested vector pET3a. Subsequently, the Zif268 gene was 
replaced with a 680-bp nonfunctional stuffer fragment. The resulting pET plasmid 
containing the stuffer fragment was used for cloning other zinc-finger genes by replacing 

1 5 the stuffer with zinc-finger genes using Spel and Xhol sites. The pET plasmids encoding 
zinc-finger genes were introduced into BL21(DE3)pLysS by chemical transformation. 
Cells were grown to mid-log phase (OD 600 0.4-0.6) in SB medium containing 50 nglm\ 
carbenicillin and 30 ng/ml chloramphenicol. Protein expression was induced by addition 
of 0.7 mM IPTG to the medium. Typically, 500-ml cultures were harvested three hours 

20 after induction. Cell pellets were resuspended in the zinc buffer containing 1 mM PMSF 
and cell were lysed by sonication for 5 minutes at O'C. Following addition of 6mM 
MgCl 2 , cell lysate were incubated with 1 0 M g/ml DNase I for 20 minutes on ice. Inclusion 
bodies containing zinc finger protein were collected by centrifugation at 25,000g for 30 
minutes and were resuspended and solubilized in 10 ml Zinc buffer containing 6M urea 

25 and 0.5 mM PMSF with gentle mixing for 3 to 12 hours at 4'C. The extract was clarified 
by centrifugation at 30,000g for 30 minutes and filtered through a 0.2-,uni low protein 
binding filter. Total protein extract was applied to a Heparin-Sepharose FPLC column 
(1 .6 x 4.5 cm) equilibrated with zinc buffer. Proteins were eluted with a 0-0.7 M NaCl 
gradient. Fractions containing zinc-finger protein were identified by SDS-PAGE and 
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pooled Protein concentration was determined by the Bradford method using BSA 
(fraction V) as a standard (Bradford, Anal. Biochem., 72:248-254, 1976). The yield of 
purified protein was from 7 to 19 mg/liter of cell culture. Protein was over 90% 
homogeneous as judged by SDS-PAGE. 

5 y;n«»+;r ftnalvsis 

The kinetic constants for the interactions between Zif268 peptides and their DN A targets 
were determined by surface plasmon resonance based analysis using the BIAcore 
instrument (Pharmacia) (Malmqvist, Curr. Opinion in Immune, 5:282-286, 1993). The 
surface of a sensor chip was activated with a mixture of EDCI and NHS for 1 5 minutes. 
10 Then 40 ,1 of affinity purified streptavidin (Pierce), 200 ,g/ml in lOmM sodium acetate 
(PH4.5), was injected at a rate of 5 ,1/min. Typically, 5000-6000 resonance units of 
streptavidin were immobilized on the chip. Excess ester groups were quenched w,th 30 
* of 1M ethanolamine. Oligonucleotides were immobilized onto the chip by injecUon 
of 40 Ml of biotinylated oligonucleotides (50 „g/ml) in 0.3 M of sodium chloride. Usually 
1 5 1 500-3000 resonance units of oligomers were immobilized. The association rate (* J was 
determined by studying the rate of binding of the protein to the surface at 5 deferent 

rate (Jfc off ) was determined by increasing flow rate to 20 ,Vmin after association phase 
The Rvalue is the average of three measurements. The ..and Rvalue were calculated 
20 usingkcore^kmeticsevaluauonsoftware.Theequilibri^ 
deduced from the rate constants. 

EXAMPLES 

pit a nr.MTT) D1SPLAXQ FMQDL FTFD 7INC FINGERS 



25 



T ihrarv Desf ff" Selection 

Phage display of the Zif268 protein was achieved by modification of the phagerntd 
display systtm pComb3 as described in Exampies 2-6. The Zif2«8 sequel from p« 
was tailored by PCR for insertion between the »ol and Spel sites of pCombS.5. As 
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described above in Example 4, insertion at these sites results in the fusion of Zif268 with 
the carboxyl terminal segment of the filamentous phage coat protein III, pill, gene. A 
single panning experiment which consists of incubating the phage displaying the zinc 
finger protein with the target DNA sequence immobilized on a microtiter well followed 
5 by washing, elution, and titering of eluted phage was utilized to examine the functional 
properties of the protein displayed on the phage surface. 

In control experiments, phage displaying Zi£268 were examined in a panning experiment 
to bind a target sequence bearing its consensus binding site or the binding site of the first 
three fingers of TFIIIA. These experiments showed that Zif268 displaying phage bound 

10 the appropriate target DNA sequence 9-fold over the TFIIIA sequence or BSA and 
demonstrated that sequence specific binding of the finger complex is maintained during 
phage display. A 4-fold reduction in phage binding was noted when Zn +2 and DTT were 
not included in the binding buffer. Two reports verify that Zif268 can be displayed on 
the phage surface (Rebar, et al t Science, 263:671-673, 1994; Jamieson, et a!., Biochem., 

15 33:5689-5695, 1994). 

In a similar experiment, the first three fingers of TFIIIA were displayed on the surface of 
phage and also shown to retain specific binding activity. Immobilization of DNA was 
facilitated by the design of stable hairpin sequence which present the duplex DNA target 
of the fingers within a single oligonucleotide which was amino labeled (FIGURE 8B) 
20 (Antao, et ai t Nucleic Acids Research, 12:5901-5905, 1991). The hairpin DNA 
containing the 9-bp consensus binding site (5'-GCGTGGGCG-3\ as enclosed) of wild- 
type Zif268 was used for affinity selection of phage-displayed zinc finger proteins. In 
addition, the 3-bp subsites (boxed) of consensus HIV-1 DNA sequence were substituted 
for wild-type Zif268 3-bp subsites for affinity selection. 

25 The amino linker allowed for covalent coupling of the hairpin sequence to acetylated BSA 
which was then immobilized for selection experiments by adsorption to polystyrene 
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microtiter wells. Biotinylated hairpin sequences worked equally well for selection 
following immobilization to streptavidin coated plate. 

Libraries of each of the three fingers of Zif268 were independently constructed using the 
previously described overlap PCR mutagenesis strategy (Barbas III, et al. Proc. Natl. 
5 Acad. Sci. USA, 22:4457-4461, 1992 and EXAMPLES 2-6). Randomization was limited 
to six positions due to constraints in the size of libraries which can be routinely 
constructed (Barbas III, Cur, Opinion in Biotech, 4:526-530, 1993). Zinc finger protein 
recognition of DNA involves an antiparallel arrangement of protein in the major groove 
ofDNA i e the amino terminal region in involved in 3' contacts with the target sequence 
1 0 whereas'the carboxyl terminal region is involved in 5' contacts (FIGURE 8B). Within a 
given finger/DNA subsite complex, contacts remain antiparallel where in finger 1 of 
Zif268, guanidinium groups of Arg at helix positions -1 and 6 hydrogen bond with the 3' 
and 5' guanines, respectively of the GCG target sequence. Contact with the central base 
in a triplet subsite sequence by the side chain of the helix position 3 residue is observed 
15 in finger 2 of Zif268, fingers 4 and 5 of GLI, and fingers 1 and 2 of TTK. Within the 
three reported crystal structures of zinc-finger/DN A complexes direct base contact has 
been observed between the side-chains of residues -1 to 6 with the exception of 4 
(Pavletich, supra; Pavletich, Science, 251:1701-1707, 1993; Fairall, et al. Nature, 
26^:483-487, 1993). 

Based on these observations, residues corresponding to the helix positions -1, 2, 3, 4, 5, 
and 6 were randomized in the finger 1 and 3 libraries. The Ser of position 1 was 
conserved in these experiments since it is well conserved at this position in zinc finger 
sequences in general and completely conserved in Zif268 (Jacobs, EMBO J., 1±:4507- 
4517, 1992). Inthennger21ibrary,helixpositions-2,-l, 1,2, 3, and 4 were randomized 
25 to explore a different mutagenesis strategy where the -2 position is examined since both 
Zif268 and GLI structures reveal this position to be involved in phosphate contacts and 
since it will have a context effect on the rest of the domain. Residues 5 and 6 were fixed 
the target sequence TTG retained the 5' thymidine of the wild type TGG site. 



20 



since 
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Introduction of ligated DNA by electroporation resulted in the construction of libraries 
consisting of 2 x 10 9 , 6 x 10 8 , and 7 x 10 8 independent transformants for finger libraries 
1 , 2, and 3, respectively. Each library results in the display of the mutagenized finger in 
the context of the two remaining fingers of wild-type sequences. 

EXAMPLE 10 
SEQUENCE ANALYSIS OF SELECTED FINGERS 

In order to examine the potential of modifying zinc-fingers to bind defined targets and to 
examine their potential in gene therapy, a conserved sequence within the HIV-1 genome 
was chosen as a target sequence. The 5' leader sequence of HIV-1 HXB2 clone at 
positions 106 to 121 relative to the transcriptional initiation start site represents one of 
several conserved regions within HTV-1 genomes (Yu, et al t Proa Natl Acad Sci. USA, 
20:6340-6344, 1 993); Myers, et al, 1992). For these experiments, the 9 base pair region, 
1 13 to 121, shown in FIGURE 8B, was targeted. 

Following selection for binding the native consensus or HIV-1 target sequences, 
functional zinc fingers were rapidly identified with an immunoscreening assay. 
Expression of the selected proteins in a pAraHA derivative resulted in the fusion of the 
mutant Zif268 proteins with a peptide tag sequence recognized by a monoclonal antibody 
(FIGURE 8A). Binding was determined in an ELISA format using crude cell lysates. A 
qualitative assessment of specificity can also be achieved with this methodology which 
is sensitive to at least 4-fold differences in affinity. Several positive clones from each 
selection were sequenced and are shown in FIGURE 9. The six randomized residues of 
finger 1 and 3 are at positions -1 , 2, 3, 4, 5, and 6 in the a-helical region, and at -2, -1 , 1 , 
2, 3, and 4 in finger 2 (FIGURE 9). The three nucleotides denote the binding site used 
for affinity selection of each finger. Proteins studied in detail are indicated with a clone 
designation. 
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Finger 1 selection with Ae consensus binding site OCO revealed a snong selection fot 
Lys a, position -1 and Arg a, position 6. Covariation between positions -1 and 2 ,s 
observed to Aree clones wWchconUta Lys and Cys a, these positions respectively. Clone 
C7 was preferentially enriched in tire selection based on to occurrence ,n 3 of Ac 12 
5 clones sequenced. Selection against the HTV-1 targe, sequence in titis regron, TOT, 
revealed a diversity of sequences with a selection for residues with hydrogen-bondrng stde 
chains in position -1 and a modes, selection for Oln a. position 3. Finger 2 selector, 
aeains, tire consensus TOO subsite showed a selection for an aromatic residue a. -1 
whereas selection agate. Ae HIV-1 urge, TTO demonso-ated a selection for a bastc 
,0 residue a. this position. The preference for Ser a, position 3 may be relevant tn tire 
cecognirionofAymidtae. Contact of thymine with Serhasbecn observed in the GLl and 
TTK aruoures (Pavletich, supra: Fairall, « at. supra). OAer modes, selections .owards 
— residues can be observed wiAto Ac table. Selections wee perfonned uuhzntg 
a supE strain of £. call which rented in Ac reading of Ae amber codon TAG as a Gin 
15 during nansUtion. Of Ae51 sequences presented in FIGURE 9, ,4 clones possessed a 
si „g,e amber codon. No clones possessed more Aan one amber codon. Selecon for 
oppression of Ae amber s,op codon in supE suins baa been noted in oAer DNA bindmg 
pr o,ein libraries and likely improves Ae quality of Ae library since tins residue ,s 
frequently uaed as a comae, residue in DNA binding proleins (Huang, « A, ?roc. Natl. 
20 Acad Sci USA, 21:3969.3973, 1994). Selection for flngerc containing free cystetnes ,s 
also noted and likely reflec* Ae experiment protocol. Phage were taeubated to a buffer 
comairung Zn- and DTT to maximise Ae number of phage bearing properly folded 
fmgers. Selection against free cysteines, presumably due .0 aggregation or -nproper 
folding, has been no.ed pmviously to phage display libraries of oAer pro«eins (Lowanan. 
25 elal.. J. MoL Biol., 234:564-578, 1993). 

For furAer chamoterisation, high level expression of zinc finger proteins was achieved 
using Ae T7 promoter (FIGURE )0, (Shadier, e, cL. supra). In FIGURE )0, protems 
were separa.ed by 15% SDS-PAGE and suited wiA Coomassie briilian. blue. Lane . 
Secular weigh, standees). Lane 2: cell exhac, before IPTG induction. Lane): 
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cell extract after IPTG induction. Lane 4: cytoplasmic fraction after removal of inclusion 
bodies by centrifugation. Lane 5: inclusion bodies containing zinc finger peptide. Lane 
6: mutant Zif268 peptide purified by Heparin-Sepharose FPLC. Clones CIO, F8, and G3 
each possessed an amber codon which was converted to CAG to encode for Gin prior to 
5 expression in this system. 

EXAMPLE 11 

CHARACTERI ZATION OF AFFINITY AND SPECIFICITY 

In order to gain insight into the mechanism of altered specificity or affinity, the kinetics 
of binding was determined using real-time changes in surface plasmon resonance (SPR) 

10 (Malmqvist, supra). The kinetic constants and calculated equilibrium dissociation 
constants of 1 1 proteins are shown in FIGURE 1 1 . Each zinc finger protein studied is 
indicated by a clone designation (for its sequence, see FIGURE 9). The target DNA site 
used for selection of each finger is indicated in bold face. The consensus binding site for 
the wild type protein is also shown in bold. The non-hairpin duplex DNA (underlined) 

15 was prepared by annealing two single-stranded DNAs. The k m association rate; k ofr , 
dissociation rate; equilibrium dissociation constant for each protein is given. 

The calculated equilibrium dissociation constants for Zif268 binding to its consensus 
sequence in the form of the designed hairpin or a linear duplex lacking the tetrathymidine 
loop are virtually identical suggesting that the conformation of the duplex sequence 
20 recognized by the protein is not perturbed in conformation within the hairpin. The value 
of 6.5 nM for Zif268 binding to its consensus is in the range of 0.5 to 6 nM reported using 
electrophoretic mobility shift assays for this protein binding to its consensus sequence 
within oligonucleotides of different length and sequence (Pavletich, supra; Rebar, supra; 
Jamieson, et aL, supra). 

25 As a measure of specificity, the affinity of each protein was determined for binding to the 
native consensus sequence and a mutant sequence in which one finger subsite had been 
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changed. FIGURE 1 1 shows the determination of dissociation rate (* otr ) of wild-type 
Zif268 protein (WT) and its variant C7 by real-time changes in surface plasmon 
resonance. The response of the instrument, r, is proportional to [protein-DNA] complex. 
Since dr/dt = k off when [protein] = 0, then * off = In (r„/r J/( V t,), where r m is the response 
5 at time V The results of a single experiment for each protein are shown. Three 
experiments were performed to produce the values shown in FIGURE 11. Clone C7 is 
improved 13-fold in affinity for binding the wild-type sequence GCG. The major 
contribution to this improvement in affinity is a 5-fold slowing of the dissociation rate of 
the complex (FIGURE 12). Specificity of the C7 protein is also improved 9-fold with 
1 0 respect to the HIV-1 target sequence. This result suggest that additional or improved 
contacts are made in the complex. Studies of protein C9 demonstrate a different 
mechanism of improved specificity. In this case the overall affinity of C9 for the GCG 
site is equivalent to Zif268 but the specificity is improved 3-fold over Zif268 for binding 
to the TGT target site by an increase in the off-rate of this complex. Characterization of 
15 proteins F8 and F15 demonstrate that the 3 base pair recognition subsite of finger 1 can 
be completely changed to TGT and that new fingers can be selected to bind this site. 

Characterization of proteins modified in the finger 2 domain and selected to bind the TTG 
subsite reveal the specificity of this finger is amenable to modification. Proteins G4 and 
G6 bind an oligonucleotide bearing the new suc-site with affinities equivalent to Zif268 

20 binding its consensus target. Specificity of these proteins for the target on which they 
were selected to bind is demonstrated by an approximately 4-fold better affinity for this 
oligonucleotide as compared to the native binding site which differs by a single base pair. 
This level of discrimination is similar to that reported for a finger 1 mutant (Jamieson, et 
al, supra). The finger 3 modified protein A14 was selected to bind the native finger 3 

25 subsite and binds this site with an affinity which is only 2-fold lower than Zif268. Note 
that protein A14 differs radically in sequence from the native protein in the recognition 
subsite. Sequence specificity in 10 of the 1 1 proteins characterized was provided by 
differences in the stability of the complex. Only a single protein, G6, achieved specificity 
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by a dramatic change in on-rate. Examination of on-rate variation with charge variation 
of the protein did not reveal a correlation. 

EXAMPLE 12 
DIMERIC ZINC FINGER CONSTRUCTION 

5 Zinc finger proteins of the invention can be manipulated to recognize and bind to 
extended target sequences. For example, zinc finger proteins containing from about 2 to 
12 zinc fingers Zif(2) to Zif(12) may be fused to the leucine zipper domains of the 
Jun/Fos proteins, prototypical members of the bZIP family of proteins (O'Shea, et aL 
Science, 254:539, 1991). Alternatively, zinc finger proteins can be fused to other proteins 
10 which are capable of forming heterodimers and contain dimerization domains. Such 
proteins will be known to those of skill in the art. 

The Jun/Fos leucine zippers preferentially form heterodimers and allow for the 
recognition of 12 to 72 base pairs. Henceforth, Jun/Fos refer to the leucine zipper 
domains of these proteins. Zinc finger proteins are fused to Jun, and independently to Fos 

15 by methods commonly used in the art to link proteins. Following purification, the Zif-Jun 
and Zif-Fos constructs (FIGURE 13 and 14, respectively), the proteins are mixed to 
spontaneously form a Zif-Jun/Zif-Fos heterodimer. Alternatively, coexpression of the 
genes encoding these proteins results in the formation of Zif-Jun/Zif-Fos heterodimers in 
vivo. Fusion with an N-terminal nuclear localization signal allows for targeting of 

20 expression to the nucleus (Calderon, et al t Cell 41:499, 1982). Activation domains may 
also be incorporated into one or each of the leucine zipper fusion constructs to produce 
activators of transcription (Sadowski, et al t Gene, UL£:137, 1992). These dimeric 
constructs then allow for specific activation or repression of transcription. These 
heterodimeric Zif constructs are advantageous since they allow for recognition of 

25 palindromic sequences (if the fingers on both Jun and Fos recognize the same DNA/RNA 
sequence) or extended asymmetric sequences (if the fingers on Jun and Fos recognize 
different DNA/RNA sequences). For example the palindromic sequence 
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5. - GGC CCA CGC JN\ GCG TGG GCG - 3 1 

3. - GCG GGT GCG W x CGC ACC CGC - 5 1 (SEQIDNO:37) 

is recognized by the Zif268-Fos/Zif268 Jun dimer (x is any number). The spacing 
between subsites is determined by the site of fusion of Zif with the Jun or Fos zipper 
5 domains and the length of the linker between the Zif and zipper domains. Subsite spacing 
is determined by a binding site selection method as is common to those skilled in the art 
(Thiesen,e/o/., Nucleic Acids Research, 13:3203, 1990). Example of the recognition of 
an extended asymmetric sequence is shown by Zif(C7) 6 -Jun/Zif-268-Fos dimer. This 
protein consists of 6 fingers of the C7 type (EXAMPLE 1 1) linked to Jun and three 
1 0 fingers of Zif268 linked to Fos, and recognizes the extended sequence: 

5. - CGC CGC CGC CGC CGC CGC (ul GCG TGG GCG - 3' 
3- - GCG GCG GCG GCG GCG GCG InJ x CGC ACC CGC - 5' 
(SEQIDNO.38) 



FX AMPLE 13 

15 rn ^TRTirTION MM TTFTNKFR PROTFTNS I£EH REPEATS 

np THF FTRST FTNC.FR OF ZIF268 

Following mutagenesis and selection of variants of the Zif268 protein in which the finger 
1 specificity or affinity was modified (See EXAMPLE 7), proteins carrying multiple 
copies of the finger may be constructed using the TGEKP linker sequence by methods 
20 known in the art. For example, the C7 finger may be constructed according to the 
scheme: 

MKLLEPYACPVESCDRRFSKSADLKRHIRIFIQESE- 

(YACPVESCDRRFSKSADLKHIRIHIQEKE) where the sequence of the last linker 
is subject to change since it is at the terminus and not involved in linking two fingers 
25 together. An example of a three finger C7 construction is shown in Figure 1 5. This 
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protein binds the designed target sequence GCG-GCG-GCG (SEQ ID NO: 32) in the 
oligonucleotide hairpin CCT-CGC-CGC-CGC-GGG-TTT-TCC-CGC-GCC-CCC GAG 
G with an affinity of 9nM, as compared to an affinity of 300 nM for an oligonucleotide 
encoding the GCG-TGG-GCG sequence (as determined by surface plasmon resonance 

5 studies). Proteins containing 2 to 12 copies of the C7 finger have been constructed and 
shown to have specificity for their predicted targets as determined by ELISA (see for 
example, Example 7). Fingers utilized need not be identical and may be mixed and 
matched to produce proteins which recognize a desired target sequence. These may also 
be utilized with leucine zippers (e.g., Fos/Jun) to produce proteins with extended 

1 0 sequence recognition. 

In addition to producing polymers of finger 1 , the entire three finger Zif268 and modified 
versions therein may be fused using the consensus linker TGEKP to produce proteins with 
extended recognition sites. For example, FIGURE 16 shows the sequence of the protein 
Zif268-Zif268 in which the natural protein has been fused to itself using the TGEKP 
15 linker. This protein now binds the sequence GCG-TGG-GCG-GCG-TGG-GCG as 
demonstrated by ELISA. Therefore modifications within the three fingers of Zif268 may 
be fused together to form a protein which recognizes extended sequences. These new 
zinc proteins may also be used in combination with leucine zippers if desired, as 
described in Example 12. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: THE SCRIPPS RESEARCH INSTITUTE 

(ii) TITLE OF INVENTION: ZINC FINGER PROTEIN DERIVATIVES AND 
METHODS THEREFOR 



(iii) NUMBER OF SEQUENCES: 31 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Spensley Horn Jubas & Lubitz 

10 (B) STREET: 1880 Century Park East, Suite 500 

(C) CITY: Los Angeles 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 90067 



15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS r DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 



20 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US95/ 

(B) FILING DATE: 18-JAN-1995 

(C) CLASSIFICATION: 



(viii) ATTORNEY/ AGENT INFORMATION : 
25 (A) NAME: Haile, Lisa A., Ph.D. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE /DOCKET NUMBER: FD-4083 



(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 619-4 55-5100 
30 (B) TELEFAX: 619-455-5110 



(2) INFORMATION FOR SEQ ID NO : 1 : 



WO 95/19431 



PCIYUS95/00829 



-89- 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .25 

10 (D) OTHER INFORMATION: /note= "Where Tyr appears, Tyr can 

also be Phe" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Tyr Xaa Cys Xaa Xaa Cys Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu 
15 10 15 

15 xaa Xaa His Xaa Xaa Xaa His Xaa Xaa 

20 25 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 36 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
25 (B) CLONE: ZF 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .36 



30 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ATGAAACTGC TCGAGCCCTA TGCTTGCCCT GTCGAG 



36 
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(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZR 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..4 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GAGGAGGAGG AGACTAGTGT CCTTCTGTCT TAAATGGATT TTGGT 4 
15 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: zif 268Xho-Spe 

(ix) FEATURE: 
25 (A) NAME/KEY: CDS 

(B) LOCATION: 1..273 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
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CTC GAG CCC TAT GCT TGC CCT GTC GAG TCC TGC GAT CGC CGC TTT TCT 4 8 
Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser 
! 5 10 15 

CGC TCG GAT GAG CTT ACC CGC CAT ATC CGC ATC CAC ACA GGC CAG AAG 96 
5 Arg Ser Asp Glu Leu Thr Arg His lie Arg lie His Thr Gly Gin Lys 

20 25 30 

CCC TTC CAG TGT CGA ATA TGC ATG CGT AAC TTC AGT CGT AGT GAC CAC 144 
Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser Asp His 
35 40 45 

10 CTT ACC ACC CAC ATC CGC ACC CAC ACA GGC GAG AAG CCT TTT GCC TGT 192 

Leu Thr Thr His He Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys 
50 55 60 

GAC ATT TGT GGG AGG AAG TTT GCC AGG AGT GAT GAA CGC AAG AGG CAT 240 
Asp He Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys Arg His 
15 65 70 75 80 

ACC AAA ATC CAT TTA AGA CAG AAG GAC ACT AGT 273 
Thr Lys He His Leu Arg Gin Lys Asp Thr Ser 
85 90 



(2) INFORMATION FOR SEQ ID NO: 5: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Leu Glu Pro Tyr Ala Cys Pro Val Glu Ser Cys Asp Arg Arg Phe Ser 
15 10 15 

Arg Ser Asp Glu Leu Thr Arg His He Arg He His Thr Gly Gin Lys 
20 25 30 



30 Pro Phe Gin Cys Arg He Cys Met Arg Asn Phe Ser Arg Ser Asp His 

35 40 45 
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Leu Thr Thr His lie Arg Thr His Thr Gly Glu Lys Pro Phe Ala Cys 
50 55 60 

Asp lie Cys Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg Lys Arg His 
65 70 75 80 

5 Thr Lys lie His Leu Arg Gin Lys Asp Thr Ser 

85 90 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
10 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE : 
15 (B) CLONE: FTX3 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1 . . 22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
20 G C AATTAACC CTCACTAAAG GG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: BZF3 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .21 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
5 GGCAAACTTC CTCCCACAAA T 21 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZF36K 

15 (ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
ATTTGTGGGA GGAAGTTTGC CNNKAGTNNK NNKNNKNNKN NKCATACCAA AATCCATTTA 60 



20 (2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE : 
(B) CLONE: R3B 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TTGATATTCA CAAACGAATG G 21 

(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 (vii) IMMEDIATE SOURCE: 

(B ) CLONE: ZFNsi-B 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..21 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CATGCATATT CGACACTGGA A 21 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 66 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(vii) IMMEDIATE SOURCE : 

(B) CLONE: ZF2r6F 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
<B) LOCATION: 1. .66 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CAGTGTCGAA TATGCATGCG TAACTTCNNK NNKNNKNNKN NKNNKACCAC CCACATCCGC 60 
ACCCAC 66 
(2) INFORMATION FOR SEQ ID NO : 12 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: AFI6rb 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
20 (B) LOCATION: 1..66 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGGCCTGTG TGGATGCGGA TATGMNNMNN MNNMNNMNNC GAMNNAGAAA AGCGGCGATC 60 
GCAGGA 66 
(2) INFORMATION FOR SEQ ID NO: 13: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: ZFIF 

5 (ix) FEATURE: 

(A) NAME /KEY: CDS 
<B) LOCATION: 1..24 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CATATCCGCA TCCACACAGG CCAG 
10 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Arg Ser Asp Glu Leu Thr Arg His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .6 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Ser Arg Ser Asp His Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



15 (ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .34 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
CGTAAATGGG CGCCCTTTTG GGCGCCCATT TACG 
20 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .8 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

5 Arg Ser Asp Glu Arg Lys Arg His 

1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 
15 (A) NAME /KEY: Peptide 

(B) LOCATION: 1. .8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Trp Ser lie Pro Val Leu Leu His 
1 5 

20 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1. .8 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Trp Ser Leu Leu Pro Val Leu His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME / KEY : Peptide 

(B) LOCATION: 1..8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

15 Phe Ser Phe Leu Leu Pro Leu His 

1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
20 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 
25 (A) NAME/KEY: Peptide 

(B) LOCATION: 1..8 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
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Leu Ser Thr Trp Arg Gly Trp His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 (ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Thr Ser lie Gin Leu Pro Tyr His 
15 1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .61 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TGATCTCAGA AG CCAAGCAG GGTCGGGCCT GGTTAGTACT TGGATGGGAG ACCGCCTGGG 60 



A 



61 
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(2) INFORMATION FOR SEQ ID NO: 24: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i) 



(ix) FEATURE: 

(A) NAME / KEY : Peptide 
10 (B) LOCATION: 1. .26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Tyr lie Cys Ser Phe Ala Asp Cys Gly Ala Ala Tyr Asn Lys Asn Trp 
1 5 10 15 

Lys Leu Gin Ala His Leu Cys Lys His Thr 
15 20 25 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

20 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
25 (B) LOCATION: 1..26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Phe Pro Cys Lys Glu Glu Gly Cys Glu Lys Gly Phe Thr Ser Leu His 
15 10 15 
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His Leu Thr Arg His Ser Leu Thr His Thr 
20 25 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS; 
5 (A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

10 (ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Phe Thr Cys Asp Ser Asp Gly Cys Asp Leu Arg Phe Thr Thr Lys Ala 
15 1 5 io 15 

Asn Met Lys Lys His Phe Asn Arg Phe His 
20 25 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) ■ 

25 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 13 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
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TGGATGGGAG ACC 13 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix> FEATURE: 
10 (A) NAME /KEY : Peptide 

(B) LOCATION: 1. .8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

Arg Ser Asp Glu Arg Lys Arg His 
1 5 



15 (2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .21 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GTCCATAAGA TTAGCGGATC C 21 
(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..21 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
10 GTGAGCGAGG AAGCGGAAGA G 



(2) INFORMATION FOR SEQ ID NO: 31: 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 34 



(Xi) SEQUENCE DESCRIPTION: SEQ "iD NO: 31: 
CCTGCGTGGG CGCCCTTTTG GGCGCCCACG CAGG 



(i) 



15 



Although the invention has been described with reference to the presently preferred 
embodiment, it should be understood that various modifications can be made without 
departing from the spirit of the invention. Accordingly, the invention is limited only by 
the following claims. 
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CLAIMS 

1. An isolated zinc finger-nucleotide binding polypeptide variant comprising at 
least two zinc finger modules that bind to a cellular nucleotide sequence and 
modulate the function of the cellular nucleotide sequence. 

2. The variant of claim 1 , wherein the modulation is enhancement of transcription 
of a gene operatively linked to the cellular nucleotide sequence. 

3. The variant of claim 1. wherein the modulation is suppression of transcription 
of a gene operatively linked to the cellular nucleotide sequence. 

4. The variant of claim 1 , which is derived from a zinc finger-nucleotide binding 
polypeptide selected from the group consisting of zif 268 and TFIIIA. 

5. The variant of claim 1 , wherein the cellular nucleotide sequence is DNA. 

6. The variant of claim 1 , wherein the cellular nucleotide sequence is RNA. 

7. The variant of claim 1 , wherein the po! peptide contains a linker region between 
zinc fingers consisting essentially of the amino acid sequence TGEKP. 

8. The variant of claim 1, wherein the cellular nucleotide sequence is a structural 
gene nucleotide sequence. 

9. The variant of claim 1 , wherein the cellular nucleotide sequence is a promoter 
nucleotide sequence. 

1 0. The variant of claim 9, wherein the promoter is an onco-promoter. 
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The variant of claim 10. wherein the promoter is a viral promoter. 

The variant of claim 1, wherein the cellular nucleotide sequence is a retroviral 
nucleotide sequence. 

The variant of claim 12, wherein the retrovirus is a human T-cell lymphotrophic 
virus (HTLV). 

The variant of claim 13, wherein the retrovirus is HTLV-1 or HTLV-2. 

The variant of claim 12, wherein the retrovirus is a human immunodeficiency 
virus (HIV). 

The variant of claim 15, wherein the retrovirus is HIV-1 or HTV-2. 

The variant of claim 1, wherein the cellular nucleotide sequence is an oncogene 
nucleotide sequence. 

The variant of claim 1, wherein the cellular nucleotide sequence is a plant 
cellular nucleotide sequence. 

A nucleotide sequence encoding a zinc fmger-nucleotide binding polypeptide 
variant of claim 1 . 

A recombinant expression vector containing the zinc fmger-nucleotide binding 
polypeptide variant of claim 1. 
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A pharmaceutical composition comprising a therapeutically effective amount of 
a zinc finger-nucleotide binding polypeptide derivative or a therapeutically 
effective amount of a nucleotide sequence which encodes a zinc finger- 
nucleotide binding polypeptide derivative, wherein the derivative binds to a 
cellular nucleotide sequence to modulate the function of the cellular nucleotide 
sequence, in combination with a pharmaceutical^ acceptable carrier. 

The pharmaceutical composition of claim 21, wherein the modulation is 
enhancement of transcription of a gene operatively linked to the cellular 
nucleotide sequence. 

The pharmaceutical composition of claim 21, wherein the modulation is 
suppression of transcription of a gene operatively linked to the cellular 
nucleotide sequence. 

The pharmaceutical composition of claim 21, wherein the zinc finger-nucleotide 
binding polypeptide derivative is a truncated wild-type zinc finger-nucleotide 
binding domain. 

The pharmaceutical composition of claim 21, wherein the zinc finger binding 
polypeptide derivative is a variant polypeptide. 

A method for inhibiting a cellular nucleotide sequence comprising a zinc finger- 
nucleotide binding motif, the method comprising contacting the motif with an 
effective amount of a zinc finger-nucleotide binding polypeptide derivative 
which binds the motif. 

The method of claim 26, wherein the zinc finger binding polypeptide derivative 
is a truncated zinc finger protein. 
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28. The method of claim 26, wherein the zinc finger polypeptide derivative is a 
variant polypeptide. 

29. The method of claim 26, wherein the cellular nucleotide sequence is DNA. 

30. The method of claim 26, wherein the cellular nucleotide sequence is RNA. 

3 1 . The method of claim 26, wherein the cellular nucleotide sequence is a structural 
gene nucleotide sequence. 

32. The method of claim 26, wherein the cellular nucleotide sequence is a promoter 
nucleotide sequence. 

33. The method of claim 26, wherein the cellular nucleotide sequence is an 
oncogene nucleotide sequence. 

34. The method of claim 26, wherein the cellular nucleotide sequence is a plant 
cellular nucleotide sequence. 

35. The method of claim 3 1 , wherein the zinc finger binding polypeptide derivative 
is a variant polypeptide. 
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36. A method for obtaining an isolated zinc fmger-nucleotide binding polypeptide 
variant which binds to a cellular nucleotide sequence comprising: 

a) identifying the amino acids in a zinc finger-nucleotide binding 
polypeptide that bind to a first cellular nucleotide sequence and modulate 

5 the function of the nucleotide sequence; 

b) creating an expression library encoding the polypeptide variant 
containing randomized substitution of the amino acids identified in step 
a) above; 

c) expressing the library in a suitable host cell; and 

10 d) isolating a clone that produces a polypeptide variant that binds to a 

second cellular nucleotide sequence and modulates the function of the 
second nucleotide sequence. 

37. The method of claim 36, wherein the library is expressed in a phage surface 
expression system. 



38. The method of claim 36, wherein the phage expression system includes a 
reducing reagent which allows folding of expression products on the phage 
surface. 



39. The method of claim 38, wherein the reducing reagent is dithiothreitol. 

40. The method of claim 36, wherein the library is randomized by PCR using 
primers containing degenerate triplet codons at sequence locations 
corresponding to the determined amino acids. 

41 . The method of claim 36, wherein the modulation of function is enhancement of 
transcription of a gene operatively linked to the cellular nucleotide sequence. 
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42. The method of claim 36, wherein the modulation of function is suppression of 
transcription of a gene operatively linked to the cellular nucleotide sequence. 

43. The method of claim 36, wherein the cellular nucleotide sequence is DNA. 

44. The method of claim 36, wherein the cellular nucleotide sequence is RNA. 

45. A zinc finger-nucleotide binding polypeptide variant produced by the method 
of claim 36. 

46. A method of treating a subject with a cell proliferative disorder, wherein the 
disorder is associated with the modulation of a cellular nucleotide sequence 
associated with a zinc finger-nucleotide binding motif, comprising contacting 
the zinc finger-nucleotide binding motif with an effective amount of a zinc 
finger-nucleotide binding polypeptide derivative that binds to the zinc finger- 
nucleotide binding motif to modulate activity of the cellular nucleotide 
sequence. 

47. The method of claim 46, wherein an expression vector comprising a 
polynucleotide sequence encoding a zinc finger-nucleotide binding polypeptide 
derivative is introduced into the cells of the subject. 

48. The method of claim 47, wherein the expression vector is a vims. 

49. The method of claim 46, wherein the modulation is enhancement of transcription 
ofa gene operatively linked to the cellular nucleotide sequence. 

50. The method of claim 46, wherein the modulation is suppression of transcription 
ofa gene operatively linked to the cellular nucleotide sequence. 
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51. A method for identifying a protein which modulates the function of a cellular 
nucleotide sequence and binds to a zinc finger-nucleotide binding motif 
comprising: 

a) incubating components comprising a nucleotide sequence encoding the 
putative protein operably linked to a first inducible promoter, and a 
reporter gene operably linked to a second inducible promoter and a zinc 
finger-nucleotide binding motif, wherein the incubating is carried out 
under conditions sufficient to allow the components to interact; and 

b) measuring the effect of the putative modulating protein on the expression 
of the reporter gene. 

52. The method of claim 51, wherein the modulation is inhibition of gene 
expression. 

53. The method of claim 51, wherein the modulation is enhancement of gene 
expression. 

54. The method of claim 5 1 , wherein the first inducible promoter is the arabinase 
promoter. 

55. The method of claim 5 1 , wherein the second inducible promoter is the lactose 
promoter. 

56. The method of claim 5 1 , wherein the incubating is performed in vitro. 

57. The method of claim 5 1 , wherein the incubating is performed in vivo. 

58. The method of claim 5 1 , wherein the reporter gene is p-galactosidase. 
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